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contribution to furthering science diplomacy. and Cooperation 
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and science diplomacy. In 2021, the award was 
renamed after David and Betty Hamburg to 
recognize their unparalleled commitment to the 
significant role of science diplomacy to advance 
science, human rights, peace, and cooperation. 
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Monetary prize of $10,000 


Commemorative plaque 
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of their nomination. 
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Advertorial 


Since the Pohang University of Science and Technology in South Korea, commonly 
called POSTECH, opened its doors in 1986, materials science and engineering research 
has been pivotal to its academic mission. POSTECH has eight institutes dedicated to 
materials research whose work regularly appears in top academic journals, and the 
faculty there are passionate about generating breakthroughs that transform how new 
technologies are developed. 

The impact of POSTECH’s materials research has the potential to transform 
everyday life, from the development of lighter, more energy-efficient and less 
expensive electronic devices to a reimagining of how we tackle human disease. 

Materials scientist Moon-Ho Jo is the director of the Center for Epitaxial van der 
Waals Quantum Solids at POSTECH (7), a part of South Korea's Institute for Basic 
Science (IBS) that is dedicated to exploring new types of atomically thin van der Waals 
(vdW) materials. 

At Jo's center, researchers will be looking to create novel classes of vdW materials 
to serve as platforms for next-generation quantum technologies. He explains that two- 
dimensional (2D) vdW materials are layered and can be stacked in different ways to 
form solids with new material properties, ranging from insulators and semiconductors 
to semimetals and superconductors. “I have set out to create heteroepitaxial vdW 
solids, where their crystal lattices and symmetries are artificially moulded with atomic 


precision, and can be translated for use in real-life devices,” he says. 


The POSTECH team (left to right): Jun Sung Kim, Jonghwan Kim, Jong Kyu Kim, Moon-Ho Jo, Gil-Ho Lee 


The future of new materials at POSTECH 


Putting a new spin on magnets 

Condensed matter physicist Jun Sung Kim works in the Center for Artificial Low 
Dimensional Electron Systems, where he focuses on researching the novel properties 
of 2D vdW magnets. 

Kim explains that these topological materials, whose electronic bands are 
described as “twisted,” are currently one of the most interesting material classes. 
Topological materials have become important in spintronic applications, in which 
the electron’s spin information is used rather than simply its charge information, 
explains Kim. It is expected that spintronics research will lead to the development of 
electronic devices that have increased data storage capabilities and faster processing 
speeds but consume less power—a change that will help reduce energy consumption 
worldwide and put the use of computing technologies on a more environmentally 
sustainable path. “We believe that novel properties in a new class of magnets could 
provide breakthroughs for developing faster, nonvolatile, and more energy-efficient 
information technology,” he says. 

Jo's research led to a recent paper published in Nature (2), which describes 
his team’s discovery that the magnetic semiconductor Mn3Si2Te6 exhibits 
unprecedentedly large angular magnetoresistance, often called colossal angular 
magnetoresistance (CAMR), due to its unique interplay between topological electronic 


structure and magnetism. “We hope that our series of discoveries on new topological 
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van der Waals magnets will provide a material platform to investigate the rich 
topological aspects of two-dimensional magnets and their functionalities for next- 


generation electronics or spintronics,” he says. 


Optoelectronic devices 
Researcher Jonghwan Kim leads a group (3) at POSTECH’s Laboratory of 
Optoelectronics Materials and Devices. For him and his team, vdW materials provide 
exciting opportunities to explore new optical science and technology developments. 

The crystal structure of these 
semiconductors is composed of a single or 
just a few atomic layers, making them the 
thinnest form of 2D materials in nature, he 
says. They have high light absorption and 
emission properties and can be flexibly and 
precisely stacked to create new materials. 

Kim's research team recently developed 
a deep-ultraviolet (DUV) light-emitting diode 
(LED) using a new material. This DUV LED 
was designed to emit UV light at a short 
wavelength of 200 nm-280 nm, which could 
potentially be used to destroy disease in 
the human body. The team used hexagonal 
boron nitride (hBN), which is a vdW layered material like graphite, to fabricate an LED 
device that emits strong UV light. 

“Throughout history, there have been many examples of new materials enabling 
groundbreaking change in real life,” says Kim. In the future, Kim looks forward to 
the possibility of applying the technology in human health care to tackle dangerous 
viruses, such as COVID-19, by sterilizing these pathogens using a shortwave length of 


UV light that has a low level of skin penetration and is less harmful to the human body. 


Next-generation electronics 

Based in the Department of Materials Science and Engineering at POSTECH, Jong Kyu 
Kim leads a cutting-edge team focused on researching materials breakthroughs that 
could support the development of more electronic devices based on 2D materials. 
Kim is focusing on advancing our understanding of the growth mechanism of hBNs 
by metal-organic chemical vapor deposition (MOCVD), which he expects will be a 
step toward developing the production of high-quality hBN building blocks for use 

in a variety of silicon-based electronic devices (4). “Recently, an hBN layer was 
successfully grown on a 2-inch, high-electron-mobility transistor, and it performed 
well,” he says. 

Kim is confident that hBN could become a new material platform for supporting 
the development of smaller, faster, and cheaper electronic devices, and could help to 
overcome the current scaling limitation of silicon-based devices. “2D materials with 
a thickness of just a few nanometers could replace current electronic devices, while 


showing a higher performance and offering additional functionality.” 


£4 Researchers at POSTECH 
combine a passion to 
create real-life applications 
for their research with 
a fundamental desire to 
better understand how our 
natural world performs.99 
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Last year, Kim's team was also featured in a leading journal (5) when they proposed 
a way to efficiently produce hydrogen fuel via water electrolysis using cheap and 
readily available nickel as an electrocatalyst. "This research is significant in that it 
provides the scholarly foundation for high performance and commercialization of a 


sustainable hydrogen energy conversion system,” explains Kim. 


Diving deeper into physical properties 
Researchers at POSTECH combine a passion to create real-life applications for their 
research with a fundamental desire 
to better understand how our natural 
world performs. Physicist Gil-Ho Lee 
(6) leads a quantum nanoelectronics 
lab at POSTECH. His research focuses 
on experimental studies on quantum 
devices based on vdW materials, 
including graphene, superconductors, 
and topological and magnetic materials. 
Lee analyses the quantum behaviour 
of devices made of these materials and 
measures their electrical properties 
at low temperatures, to determine 
how these lessons could inform the 
development of computing or sensors at the nano level. He says: “As the field of 
electromagnetism in physics opened up the field of electrical engineering, so | hope 
quantum mechanics will open up the field of so-called quantum engineering." 
Recently, Lee’s team, together with collaborators, developed quantum sensors for 
microwave and infrared lights. This work, in which they proposed using graphene- 
based quantum sensors to exploit graphene’s minute electronic heat capacity, was 
published in leading international journals (7, 8). “This research is a good example 
of how a fundamental understanding of quantum materials can be used for sensor 
applications,” explains Lee. 
Together with his colleagues across POSTECH, Lee is looking for answers that could 
help researchers understand the quantum physics of nanodevices and how these 


solutions could support us in meeting some of the greatest challenges facing society. 
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676 News ata glance 


679 New subvariants are masters of 
immune evasion 

Vaccines and prior infection still prevent 
severe disease from new SARS-CoV-2 strains 
By G. Vogel 


680 Better lipids to power next 
generation of mRNA vaccines 

New delivery systems aim to increase vaccine 
potency and reduce side effects 

By E. Dolgin 

PODCAST 


682 Wrestling with bird flu, Europe 
considers once-taboo vaccines 

To lessen the toll of culling, some countries 
launch vaccine trials despite trade 
implications and public health risks 

By E. Stokstad 


683 Pandemic delays continue to plague 
polar science 

Backlog of NSF-funded work will allow few 
new projects By P. Voosen 


684 Odds for winning NSF grants improve 
as competition eases 

Agency reports a 17% drop in number of 
grant applications since 2011, troubling some 
observers By J. Mervis 
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685 ‘Hot’ climate models exaggerate 
Earth impacts 

U.N. report authors say researchers 
should avoid suspect models By P Voosen 


686 The bias hunter 

Itiel Dror is determined to reveal the role 

of bias in forensics, even if it sparks outrage 
By D. Starr 


INSIGHTS 


692 The limits of forest carbon 
sequestration 

Current models may be overestimating 
the sequestration potential of forests 
By J. K. Green and T. F. Keenan 

REPORT p. 758 


694 Immune cells impede repair 

of old neurons 

Interfering with age-related neuroimmune 
interactions promotes nerve regeneration 
By N. Gaudenzio and R. S. Liblau 

RESEARCH ARTICLE p. 715 


695 Mapping cell types 

across human tissues 

Single-cell analyses reveal tissue-agnostic 
features and tissue-specific cell states 

By Z. Liu and Z. Zhang 


RESEARCH ARTICLES pp. 711, 712, 713, 
& 10.1126/SCIENCE.ABOO510 


697 Recording bacterial responses to 
changes in the gut environment 

A CRISPR-based tool reveals intestinal 
microbiota gene expression through time 
By L. Zahavi and E. Segal 

RESEARCH ARTICLE p. 714 


698 Beating natural proteins 

at filtering water 

Artificial fluorous channels outperform 
aquaporins in water permeation By Y. Shen 
REPORT p. 738° 


699 Anomalous fluid flow in 
quantum systems 

New regimes of hydrodynamics are 
probed with synthetic quantum matter 
By A. Morningstar and W. Bakr 

RESEARCH ARTICLES pp. 716 & 720. 


701 Investor-state disputes threaten 
the global green energy transition 
Global action on climate change could 
generate upward of $340 billion in legal 
claims from oil and gas investors 

By K. Tienhaara et al. 


704 Rethinking genetic disease 
Carriers of the fragile X premutation can 
experience medical conditions of their own 
By B. Sullivan 


705 The becoming of the human brain 
A neuroscientist traces the development 
of the body's most complex organ 

By R. D. Fields 


13 MAY 2022 » VOL 376 ISSUE 6594 671 


||; HUMAN VACCINES PROJECT MICHELS#®N 
6@2 DECODING THE IMMUNE SYSTEM . RESI 


The Michelson Prizes: Next Generation Grants 
support young investigators applying disruptive 
research concepts to significantly advance the 
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A rigorous and competitive international search 
will identify the most innovative projects. 
We encourage applications from the full 


spectrum of scientific disciplines 
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It ain't over 'til it’s over 


By H. H. Thorp 
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A survey of cell types across tissues as part of 
the Human Cell Atlas, mapped with single-cell 
transcriptomics in three papers in this issue, 
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2022 Mike Hogg Award and Lecture 
Recognizing May-Britt Moser, Ph.D. 


MD Anderson Cancer Center congratulates May-Britt Moser, Ph.D., 
on being named the 2022 Mike Hogg Award recipient. 


With this honor, MD Anderson recognizes Moser’s research, together with 
long-term collaborator Edvard Moser, on fundamental cognitive functions 
shared across animals, with a mechanistic focus on how the brain calculates 
self-position and how the information is stored in memory. 


Moser is a professor of Neuroscience, founding director of the Center 
for Neural Computation, and co-director of the Kavli Institute for 
Systems Neuroscience at the Norwegian University of Science and 
Technology. She was awarded the Nobel Prize in Physiology or 
Medicine in 2014, together with E. Moser and John O'Keefe. 


Moser’s lecture will be delivered virtually. The event is sponsored by the Mike 
Hogg Fund and hosted by MD Anderson’s Division of Education and Training. 


THE UNIVERSITY OF TEXAS 
MD Anderson Ganeer Center 
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EDITORIAL 


It ain’t over ’til it’s over 


he Biden administration is sheepishly waving 

a checkered flag on the pandemic. If you look 
closely, you can see its members cringing as they 

do so. Chief Medical Advisor Anthony Fauci told 

the PBS Newshour that the United States was “out 

of the pandemic phase” and then walked it back, 
saying he meant that the “acute component” of 

the pandemic phase was over. President Biden attended 
the likely superspreading White House Correspondents’ 
Dinner last weekend but skipped cocktails and the meal, 
opting to just give his talk. Fauci avoided the whole affair. 
Meanwhile, Vice President Harris continued to isolate 
after her positive COVID-19 test, and many members of 
Congress and the administration announced positive test 
results as well. All of this happened 
while the White House allowed a 
renegade federal judge in Florida 
(where else?) to end the nationwide 
mask mandate without much of a 
fight. These mixed messages have 
been emanating from the admin- 
istration for months now, and al- 
though those with resources have 
tools to manage COVID-19, care 
needs to be taken that those with- 
out such means are not forgotten. 
When Biden pledged to “follow 
the science,” it was hard to imagine that the country 
could have ended up here. But the administration 
made a big bet that vaccines would provide sterilizing 
immunity and end the pandemic, allowing it to move 
on to other priorities. Leaving behind the insanity of 
ivermectin, hydroxychloroquine, and bleach was cer- 
tainly a great step forward. However, evolution has 
had other plans, and variants of severe acute respira- 
tory syndrome coronavirus 2 (SARS-CoV-2, the virus 
that causes COVID-19) have kept the pandemic going. 
This left the White House in a very tight spot: There 
was little political will to keep pushing nonpharma- 
ceutical interventions, yet the pandemic was far from 
over. Add to this mounting inflation worries and con- 
cerns about the war in Ukraine, and the response has 
been a clumsy pivot to a message that politicians al- 
ways turn to: personal responsibility. Get vaccinated, 
get boosted, wear a mask, get a prescription for the 
antiviral Paxlovid—if you want to. This may be fine if 
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“ «More variants... 
are on the horizon... 


this is no time 
to drop the ball.” 


you have a healthy immune system, great health insur- 
ance, and the ability to navigate the US health care 
system. But what about everyone else? 

COVID-19 is at a similar place to where the HIV/AIDS 
global pandemic was when the antiretroviral drugs 
came along. Yale epidemiologist Gregg Gonsalves told 
me about important parallels between both pandem- 
ics. “The HIV epidemic didn’t go away,” he said. “It just 
went to where people could ignore it. It went into the 
rural South, it went to communities that were already 
facing disparities in health.” At that time, confusion 
between medicine and public health was also an im- 
portant factor. “The discourse shifting to private choice 
and private adjudication of risk is really not what pub- 
lic health science is,” he said. “We 
work in populations. And if we’re 
talking about medicine, it’s about 
private risk and private choices.” 

This all hit home for me 
when—while I was preparing this 
editorial—I tested positive for 
COVID-19. After writing about the 
virus for two-and-a-half years, it 
was in my body. But I’d had four 
shots of the vaccine to bolster my 
already robust immune system, 
a supply of rapid test kits, and a 
prescription for Paxlovid from my doctor. The fever was 
gone within a few hours of taking the antiviral, and I 
tested negative a few days later. Great news for me, but 
not for those who don’t have these resources. 

Legendary public health leader Paul Farmer summed 
up this situation well: “Those whose lives are rarely 
touched by structural violence are uniquely prone to 
recommend resignation as a response to it,” he said. “In 
settings in which all of us are at risk, as is sometimes 
true of contagion shared through the air we breathe, we 
must also contemplate containment nihilism—the atti- 
tude that preventing contagion simply isn’t worth it.” 

SARS-CoV-2 is rapidly mutating and recombining, and 
more variants and subvariants—potentially more patho- 
genic—are on the horizon. The world is still barely vac- 
cinated, and even in wealthy countries like the United 
States, resources are inequitably distributed. It absolutely 
ain’t over. And this is no time to drop the ball. 

-H. Holden Thorp 


H. Holden Thorp 
Editor-in-Chief, 
Science journals. 
hthorp@aaas.org: 
@hholdenthorp 


Published online 5 May 2022; 10.1126/science.abq8460 
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46 We have to kind of hold hope and grief 
at the same time. 99 


Jacquelyn Gill of the University of Maine, Orono, quoted in the Associated Press 
on the mind frame of climate scientists. 


IN BRIEF Edited by Lila Guterman 


SEISMOLOGY 


A martian big one 


ASA reported this week that its InSight lander on Mars hit the 

seismic jackpot, recording a magnitude 5 marsquake, by far 

the largest ever seen. Since InSight landed in 2018, seismo- 

logists have dreamed of such a quake, large enough to allow 

waves to encircle the planet’s surface, providing InSight’s seis- 

mic station an ultraprecise epicenter location and with it, a 
sort of skeleton key for the planet’s interior. Using smaller quakes, 
InSight’s team has managed to chart the thickness of the martian 
crust, mantle, and core; with the new quake, this picture is expected 
to grow far more precise. The quake was well timed, coming 3 days 
before the lander entered a hibernating “safe mode” because of low 
power from its dust covered solar panels. The spacecraft is likely to 
expire in the coming weeks as martian winter approaches. 


U.S. limits use of J&J vaccine 


covib-19 | The COVID-19 vaccine made 
by Johnson & Johnson (J&J) should only 
be given to people who cannot receive 
any other vaccine, the U.S. Food and 
Drug Administration (FDA) said on 5 May. 
The move is related to thrombosis with 
thrombocytopenia syndrome (TTS), a 
rare, serious clotting disorder that has 
been associated with the vaccine. FDA 
has confirmed 60 cases of TTS, nine of 
them fatal, after 18.7 million doses of J&J 
were administered. The vaccine’s benefits 
still outweigh the risks, the agency says, 
but alternatives that don’t cause TTS are 
readily available. Scientists have not been 
able to pin down why the vaccine—and a 
similar one, made by AstraZeneca—cause 
the syndrome. 


Verdict in China Initiative case 


RESEARCH SECURITY | An applied math 
professor at Southern Illinois University 
(SIU), Carbondale, last week was found 
guilty of failing to report a Chinese bank 
account on his U.S. tax returns but cleared 
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of charges that he committed grant fraud. 
The 4 May verdict against Minqqing Xiao 
came in the fourth jury trial resulting from 
the China Initiative, a controversial U.S. law 
enforcement campaign that has led to the 
prosecution of some two dozen U.S. academ- 
ics, most of them of Chinese ancestry. “It’s 

a massive victory for Ming because the 
government was not able to prove that 
Ming did anything wrong in applying for 
his [National Science Foundation] grant,” 
said Ed Benyas, an SIU music professor who 
helped organize daily vigils to the Benton, 
Illinois, courthouse, where supporters wore 
“T stand with Ming” buttons. Xiao, on paid 
administrative leave from SIU, faces up to 

5 years in prison and a substantial fine. 
Sentencing is set for 11 August. 


Donor pig heart harbored virus 


BIOTECHNOLOGY | The pig heart trans- 
planted into a human patient in a 
groundbreaking surgical experiment 
carried a porcine virus that may have 
contributed to the man’s death, MIT 
Technology Review reported last week. 
In January at the University of Maryland 


School of Medicine (UMSOM), David 
Bennett, a heart failure patient, received 
the heart of a genetically modified pig 
created by the company Revivicor. When 
UMSOM researchers announced Bennett’s 
death in March, they shared no details 
about its cause. But in a 20 April webinar, 
UMSOM surgeon Bartley Griffith said the 
team had detected porcine cytomegalo- 
virus in Bennett’s blood. The virus has been 
shown to damage pig hearts transplanted 
into baboons, leading to the primates’ 
death. The finding suggests immune rejec- 
tion of the organ was not to blame for 
Bennett’s death, and that thorough virus 
screening before transplant could improve 
survival time. 


Volunteers, Al bag 1000 asteroids 


PLANETARY SCIENCE | An army of 11,400 
citizen scientists joined forces with 
artificial intelligence (AI) to spot more 
than 1000 previously unknown asteroids. 
In 2019, researchers led by the European 
Space Agency enlisted volunteers to peruse 
more than 37,000 images taken by the 
Hubble Space Telescope over nearly 

20 years. Because the exposures were 

30 minutes long, asteroids appear as 


Asteroid trails appear curved or S-shaped in this 
image from the Hubble Space Telescope. 
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This bridal veil stinkhorn 
fungus breaks down plant 
matter in the soil. 


curved lines or streaks. The volunteers 
spotted more than 1000 of these traces, 
which were then used to train an AI system 
to find more, bumping the total to 1701 

in 1316 images, the team announced last 
week. Roughly one-third are known aster- 
oids, but the rest are new, and are thought 
to be small lumps of rock in the main belt 
between Mars and Jupiter. Researchers 
will now try to assess their orbits, sizes, 
and rotation rates, to learn more about the 
rubble from which the planets formed. 


Disgraced surgeon on trial 


MiscoNbDucT | Paolo Macchiarini, the 
surgeon famed and then disgraced for 
implanting artificial tracheae seeded with 
stem cells into patients, took the stand in 
his trial for assault in Solna, Sweden, last 
week. Macchiarini performed at least eight 
of the transplants between 2011 and 2014. 
All recipients died except one, who had the 
implant removed. In 2016, the Karolinska 
Institute (KI) fired Macchiarini, who was 
later found guilty of fraud and research 
misconduct. Now, Swedish prosecutors 
have charged him with assault or “bodily 
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ECOLOGY 


Soil fungi keep 
ecosystems humming 


ushrooms not only add color to the 
landscape, they help keep the natural 
world productive and stable. Places 
with a wider variety of soil fungi 

that break down plant matter—the 


so-called decomposers—are better able 

to cope with drought and other stressors, 
ecologists reported this week in Nature 
Ecology & Evolution. The researchers 
determined the fungal makeup of about 
700 soil samples from tropical, temper- 
ate, and polar climates. They mapped the 
diversity of different kinds of soil fungi 
onto 2 decades’ worth of satellite images 
of plant photosynthesis, a measure of the 
ecosystem’s productivity. Diverse fungal 
decomposers helped plants stay produc- 
tive over time, but diversity of pathogenic 
fungi had the opposite effect, says lead 
author Manuel Delgado-Baquerizo, an eco- 
system eco-logist at the Spanish National 
Research Council. He and others suggest 
finding a way to ‘enhance decomposer 
diversity may bufferagainst the negative 
effects of climate change. ®% 


harm due to negligence” related to three 
patients who received transplants at KI. 
Prosecutors argue Macchiarini performed 
the transplants with “reckless intent,” 
knowing complications plagued earlier 
recipients. But Macchiarini testified that 
he performed the surgeries with the 
knowledge and support of KI and that the 
surgeries were a last resort for the patients. 
The trial, which began on 27 April, is 
expected to conclude on 23 May. 


Stanford gets climate windfall 


PHILANTHROPY | Stanford University will 
use a $1.1 billion gift—the second largest 
to an academic institution in history—to 
create a school focused on climate change. 
The gift, from John Doerr, a Silicon 

Valley venture capitalist, and his wife 

Ann Doerr, board chair of Khan Academy, 
will establish a school with 90 faculty 
members, including many from existing 
departments; Stanford will add 60 more 
over a decade. Climate-focused schools 
are a trend in U.S. higher education, with 
Columbia University announcing a similar 
one in 2020. 


: 


Pediatricians take stand on race 


BIOMEDICINE | The American Academy of 
Pediatrics (AAP), a professional association, 
announced last week it would eliminate 
race-based medicine in its recommendations 
and publications. Race has no use as a proxy 
for a person’s risk of health conditions or for 
other biological traits, AAP said in its policy 
announcement, noting that race-based medi- 
cine has increased racial health disparities. 
The group will revisit its “entire catalog,” 
including its guidance for newborns with 
jaundice, which currently says that East 
Asian race is a risk factor for the condition. 


U.S. preps for quantum hackers 


CRYPTOGRAPHY | Someday, quantum 
computers—exotic machines that can 
solve practical problems that would stymie 
conventional computers—will be able to 
crack the encryption algorithms used to 
protect internet communications. So, last 
week President Joe Biden’s administra- 
tion instructed federal agencies to prepare 
to shift to new algorithms that can resist 

a quantum attack. The memo envisions 
677 
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the decadelong shift beginning in 2024, 
when the first standard for such “post- 
quantum cryptography” should be set by 
the National Institute of Standards and 
Technology (NIST). Fortunately for internet 
companies, postquantum cryptography will 
involve changes mostly in software. Still, 
NIST researchers say the transition will be 
challenging and expensive. 


BIOLOGY 


NSF revs up innovation engines 


TECHNOLOGY TRANSFER | Details of the 
competition for the largest cash awards 
in National Science Foundation (NSF) 
history were announced last week. NSF 
hopes to spend $160 million over 

10 years on each of five “regional inno- 
vation engines” that will headline the 


Speed control keeps flocks in sync 


arge flocks of starlings doing their synchronized dances across the sky stay 
together without constantly bumping by adjusting to one another's subtle 
changes in speed, researchers have determined. Theoretical physicist Antonio 
Culla from the Italian National Research Council and colleagues recorded flocks 
of starlings and then experimented with computer models until they came up 
with one whose on-screen birds behaved like real birds. The secret, Culla’s team 
reported this week in Nature Communications, is having “marginal speed control”: 
The birds, which fly 8 to 18 meters per second, slow down or speed up a little to 
keep up with birds nearby but avoid large changes in speed, which could lead to the 
breakup of the flock, he says. The finding could one day help drones better coordinate 


their flight in swarms. 
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agency’s new technology directorate. The 
competition kicks off this year as teams of 
university scientists, companies, and local 
governments vie for 50 $1 million starter 
grants. NSF is asking applicants for their 
plans to conduct user-inspired research, 
translate the results into high-tech prod- 
ucts and services, and train local people 
for the additional jobs that are created. 
The five most promising approaches will 
win the jackpot in a second round of 
funding next year. NSF says it will favor 
teams from areas “without well-established 
innovation ecosystems,” meaning long 
odds for traditional high-tech regions 

like Silicon Valley, Boston, and North 
Carolina’s Research Triangle Park. 


$334M penalty in patent case 


INTELLECTUAL PROPERTY | Last week, 

a Delaware jury ordered U.S. sequencing 
company Illumina, Inc. to pay $333.8 mil- 
lion to Chinese genomics firm BGI Group 
after finding Illumina infringed two of 
BGI’s patents for DNA sequencing tech- 
nology. lumina and Complete Genomics, 
Inc., a California-based arm of BGI, have 
each claimed the other infringed their 
“2-channel” technology patents, which 
speed sequencing by imaging bases in 
parallel. Illumina has said it plans to 
appeal the verdict. Meanwhile, the com- 
panies are also locked in patent battles 

in Denmark, Germany, Switzerland, and 
Turkey. The decision comes on the heels 
of another case in San Francisco, where 

a jury found BGI infringed a different 
Illumina patent and ordered it to pay 

$8 million. 


U.S. gun murders surged in 2020 


PUBLIC HEALTH | Firearm homicide rates 
in the United States increased by 35% from 
2019 to 2020, reaching their highest level 
since 1994, the U.S. Centers for Disease 
Control and Prevention reported this week 
in the Morbidity and Mortality Weekly 
Report. The rate of gun murders increased 
from 4.6 to 6.1 per 100,000 people as the 
COVID-19 pandemic took hold. The biggest 
increases were in Black boys and men ages 
10 to 44 and in American Indian/Alaska 
Native men ages 25 to 44. Rates also grew 
with increasing poverty. The firearm 
suicide rate also grew in 2020, by 2.5%; 
among American Indian/Alaska Natives, 
the rate grew by 41.8% even as gun murder 
rates in this group grew by 26.5%, less 
than the overall U.S. increase. 
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Awoman receives a COVID-19 vaccine in New Delhi in June 2021. The continuing evolution of SARS-CoV-2 is a challenge for vaccine developers. 


New subvariants are masters of immune evasion 


Vaccines and prior infection still prevent severe disease from new SARS-CoV-2 strains 


By Gretchen Vogel 


nce again, South Africa is at the 

forefront of the changing COVID-19 

pandemic. Epidemiologists and viro- 

logists are watching closely as 

cases there rise sharply again, just 

5 months after the Omicron variant 
caused a dramatic surge. This time, the 
drivers are two new subvariants of Omi- 
cron named BA.4 and BA.5, which the Net- 
work for Genomic Surveillance in South 
Africa first detected in January. 

The new strains didn’t have much of an 
impact initially, but over the past few weeks 
case numbers in South Africa jumped from 
roughly 1000 per day on 17 April to nearly 
10,000 on 7 May. A third subvariant called 
BA.2.12.1 is spreading in the United States, 
driving increases along the East Coast. 

It’s still unclear whether the new sub- 
variants will cause another global COVID-19 
wave. But like the earlier versions of Omi- 
cron, they have a remarkable ability to 
evade immunity from vaccines, previous 
infection, or both—a disturbing portent for 
the future of the pandemic and a potentially 
serious complication for vaccine developers. 
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In most cases, vaccination or earlier 
infection still seem to provide protection 
from severe disease. “There’s no reason to 
freak out,’ says John Moore, an immuno- 
logist at Weill Cornell Medicine. The new 
strains are “an additional hassle,” he says, 
but “there’s no indication that they’re more 
dangerous or more pathogenic.” 

Hospitalizations in South Africa, for ex- 
ample, have increased, “but because it is 
starting from a very low level, it’s not cause 
for alarm,” says virologist Tulio de Oliveira 
of Stellenbosch University, who helped iden- 
tify BA.4 and BA.5. Numbers of patients in 
intensive care units are as low as they have 
been since the start of the pandemic, he 
says. “At the moment, we expect something 
similar to the Omicron BA.1 wave,’ when 
hospitalization rates stayed manageable. 

The new superspreaders do, however, 
showcase the restless virus’ ability to find 
ways around the “immunity wall” built up 
over the past 2 years and to continue to 
circulate at high levels. Even if the new 
variants cause relatively little severe dis- 
ease, “it’s a numbers game,” says Leif Erik 
Sander, an infectious disease expert at 
the Charité University Hospital in Berlin; 


enough new infections could still over- 
whelm health systems. 

All three new strains share key muta- 
tions with the BA.2 strain of Omicron, 
which, like BA.1, emerged in southern Af- 
rica in October 2021. Initial studies by de 
Oliveira and Alex Sigal, an infectious dis- 
ease expert at the Africa Health Research 
Institute in Durban, suggest BA.4 and BA.5 
can elude the immunity of patients who 
were infected with the BA.1 strain, which 
in South Africa caused a much larger wave 
than BA.2. That may be in part because 
immunity has waned since South Africa’s 
BA.1 wave peaked in December. People who 
were both vaccinated and infected had 
somewhat stronger protection, de Oliveira 
and Sigal reported in a 2 May preprint. 

All three new variants have mutations that 
alter a key amino acid called L452, which may 
help explain their ability to dodge immunity. 
1452 is part of the receptor-binding domain, 
the part of the spike protein that locks onto 
cells, enabling infection. The domain is also 
a key target for protective antibodies. 

The Delta variant that caused devastating 
surges around the world in 2021 had mu- 
tations in L452 as well, so many scientists 


13 MAY 2022 * VOL 376 ISSUE 6594 679 


NEWS | IN DEPTH 


have been watching this hot spot carefully, 
including immunologist Yunlong Richard 
Cao of Peking University. On 11 April, Cao 
says, he and his colleagues noticed a pat- 
tern: New Omicron sublineages from New 
York, Belgium, France, and South Africa all 
had changes in L452. “The independent ap- 
pearance of four different mutations at the 
same site? That’s not normal,” Cao says. The 
researchers suspected it was the virus’ re- 
sponse to the high levels of immunity gen- 
erated by the huge Omicron waves. 

They immediately started to make cop- 
ies of the spike protein based on the new 
sequences and test how well different anti- 
bodies could block those proteins, prevent- 
ing them from binding to cells. They used 
sera from 156 vaccinated and boosted sub- 
jects, including some who had recovered 
from either BA.1 or severe acute respiratory 
syndrome (SARS), the corona- 
virus disease that caused a 
deadly global outbreak almost 
2 decades ago. Like the South 
African team, they found that 
blood from patients who had 
been infected with BA.1 had 
only weak ability to neutralize 
BA.4 and BA.5; the same was 
true for BA.2.12.1. Even less ef- 
fective were sera from people 
who had previously been infected with 
SARS and then vaccinated against COVID- 
19, they reported in a 2 May preprint. 

The latter finding was surprising. Pre- 
vious work by Linfa Wang, a bat corona- 
virus researcher at the Duke-NUS Medical 
School in Singapore, had shown patients 
who had recovered from SARS and were 
then vaccinated had strong protection 
against earlier SARS-CoV-2 variants—and 
even some related animal viruses—a find- 
ing that seemed to hold clues to develop- 
ing vaccines effective against multiple 
coronaviruses, including those that might 
trigger the next pandemic. But the new 
mutations apparently helped the Omicron 
subvariants evade those previously power- 
ful antibodies. 

Wang notes, however, that the subjects 
in the new study were all vaccinated with 
CoronaVac, a Chinese vaccine made from 
inactivated virus. Subjects in his study were 
vaccinated with messenger RNA (mRNA) 
vaccines, which might provide a more po- 
tent response to the new strains, he says. 
But Wang agrees that Omicron’s knack for 
immune escape is dramatic. Based on its 
immunological profile, it “should be called 
SARS-3,” he says—an entirely distinct virus. 

Omicron’s rapid evolution creates diffi- 
cult decisions for vaccine- and policymak- 
ers about whether to shift to a new set 
of vaccines or stick with the current for- 
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“We need 

to focus on 
broadening our 

immunity.” 


Kristian Andersen, 
Scripps Research 


mulations, which are based on the virus 
that emerged in Wuhan, China, more than 
2 years ago. Moderna has tested two “bi- 
valent” versions of its mRNA vaccine, con- 
taining the ancestral strain and either the 
Beta variant—which spread in South Africa 
for a while in 2021 but is now gone—or the 
Omicron BA.1 variant. The company has 
not yet reported data on how well they 
might protect against the new subvariants. 

Pfizer, the other mRNA vaccine pro- 
ducer, has tested the efficacy of a booster 
and a primary vaccine based on BA.1. Re- 
sults are expected by the end of June. The 
U.S. Food and Drug Administration has 
scheduled a meeting for 28 June to analyze 
available data and make vaccine recom- 
mendations for the fall. 

The limited protection that BA.1 in- 
fection provided against the new sub- 
variants in lab studies has al- 
ready raised questions about 
how useful the new Omicron- 
specific vaccines might be. 
Wang says the virus is evolving 
too quickly for strain-specific 
vaccines to keep up. Instead, a 
broad cocktail of monoclonal 
antibodies targeting  differ- 
ent strains might be the best 
way forward, he says. 

Such a shot could prevent infections 
for several months in those vulnerable 
to severe disease, including immuno- 
compromised people who don’t respond to 
vaccines. Protecting that group is crucial, 
he notes, because many researchers sus- 
pect new variants emerge during long-term 
infections in people whose immune sys- 
tems fail to clear the virus. The main 
hurdle, Wang says, is cost: A dose of mono- 
clonal antibodies is about $1000 per pa- 
tient, he notes, “but if someone could find 
a way to lower that to $50 or $100,” the 
approach could be cheaper than constantly 
updating vaccines. 

Kristian Andersen, who studies viral 
evolution at Scripps Research, draws a 
sobering lesson from the newest Omicron 
variants. Although we don’t know what fu- 
ture variants will look like, he says, “we can 
be certain that they’ll continue to be more 
and more capable of immune escape,” pos- 
sibly leading to lower protection against 
not just infection, but also against severe 
disease. “We need to focus on broadening 
our immunity,” he says. 

It’s far from clear what kind of vaccine 
might prompt that broadened immunity, 
but “we really, really need to get going” to 
figure that out, Andersen says. “Simply let- 
ting the virus do what viruses do—continue 
to infect us, and likely several times a year— 
just isn’t an option in my playbook.” 


VACCINE DEVELOPMENT 


Better lipids 

to power next 
generation of 
mRNA vaccines 


New delivery systems aim 
to increase vaccine potency 
and reduce side effects 


By Elie Dolgin 


s any dietician will tell you, some 
fats are good—and that is surely 
true of the little fatty balls found 
in two of the world’s most widely 
used COVID-19 vaccines. Known as 
lipid nanoparticles (LNPs), these 
tiny bubbles of fat encase messenger RNA 
(mRNA) that encodes a viral protein, help- 
ing ferry it into cells and shield it from 
destructive enzymes. The technology was 
key to the success of COVID-19 shots from 
Moderna and the Pfizer-BioNTech collabo- 
ration. But as beneficial as these fats are, 
there is plenty of room for improvement. 

The nanoparticles are a major source of 
unwanted side effects when they spread 
through the body, triggering the aches 
and inflammation many people experience 
after vaccination. They do a poor job of 
unloading their cargo once inside cells, a 
necessary step for the proteinmaking ma- 
chinery to turn the mRNA sequences into 
immune-priming signals. And because 
they tend to fall apart when warm, they 
have to be stored at low temperatures, lim- 
iting their global use. 

“This is a system that clearly has legs,” 
says biochemist Pieter Cullis of the Univer- 
sity of British Columbia (UBC), Vancouver, 
who created the first LNPs, but “we still 
need to increase the efficiency of LNPs— 
that’s for sure.” 

A new generation of LNPs with greater 
potency, fewer side effects, increased sta- 
bility, and more precise tissue-targeting 
properties is now under development at big 
pharma and biotech startups. Big money 
is at stake: These improved nanoparticles 
could lead to better mRNA vaccines for 
COVID-19 and other diseases. They might 
also help mRNA deliver on its promise as a 
therapeutic tool to treat disease. “There are 
innovations on delivery that certainly could 
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change the game,” says Philip Santangelo, 
a biomedical engineer at the Georgia In- 
stitute of Technology who has collaborated 
with several mRNA companies. 

Cullis and colleagues developed the first 
LNPs about 20 years ago to carry gene- 
silencing drugs into cells. He and others 
later tailored the LNPs’ four lipid compo- 
nents to deliver disease-correcting mRNA 
to faulty cells. Now that they are being put 
to a new use, in vaccines, “There is still so 
much optimization and development that 
needs to happen,” says UBC bioengineer 
Anna Blakney, co-founder of the RNA vac- 
cine company VaxEquity. And when it comes 
to understanding how cells inter- 
act with the nanoparticles, “it’s just this 
big question mark,” she adds. 

One clue emerged earlier this year 
when Genentech scientists showed how 
nanoparticles activate a particular in- 
flammatory pathway, the interleukin-1 
axis, which is critical to generating 
protective immune _ responses but 
can also spur side effects. Among the 
LNPs tested, one made with SM-102, 
an “ionizable” lipid that helps bind 
and package mRNA into LNPs, proved 
an especially strong instigator of this 
pathway. That could help explain why 
Moderna’s shot, which relies on SM- 
102, is both highly effective and prone 
to making people feel icky. 

The Genentech team did not evalu- 
ate the comparable lipid found in the 
Pfizer-BioNtech vaccine. But Mohamad- 
Gabriel Alameh and colleagues from the 
University of Pennsylvania Perelman 
School of Medicine tested a closely re- 
lated one and found that it triggered a 
wide range of inflammatory molecules, 
both desired and not. The goal now is 
to design ionizable lipids that activate 
favorable immune pathways without 
overstimulating detrimental ones, says 
Alameh, who co-founded AexeRNA 
Therapeutics with bioengineer Michael 
Buschmann of George Mason Univer- 
sity and others. “Is it very simple? No,’ 
Alameh says, but it should be possible. 

Before his death in March, 
Buschmann led a team that in 2021 showed 
how the electric charge of LNPs is critical to 
vaccine success. A negative charge makes the 
particle less likely to stay in the muscle and 
lymph nodes of injected mice, where it could 
elicit beneficial immune responses; instead 
it tends to spread widely, raising the risk of 
fevers, chills, and other adverse reactions. 

To make a less negatively charged 
nanoparticle, the researchers tweaked the 
chemistry of the ionizable lipid. When 
formulated into an mRNA vaccine for 
COVID-19, the new LNP carrier prompted 
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mice to produce more protective antibod- 
ies than standard delivery systems and had 
fewer side effects, according to data posted 
last year in preprint form and now under 
review at Nature Communications. 

Dan Peer, a biochemist at Tel Aviv Uni- 
versity and co-founder of the vaccine de- 
livery startup NeoVac, has also developed 
libraries of new ionizable lipids with atypi- 
cal structures. In unpublished experiments, 
they seem to enable better mRNA vaccines 
with fewer side effects and also extend 
their shelf stability at room temperatures. 

Other improvements could come from 
boosting the uptake of LNPs into cells and 


The particulars of nanoparticles 

To boost vaccine potency and limit side effects, researchers 
are altering each of the four ingredients that make up lipid 
nanoparticles. Each particle includes ionizable lipids that bind 
mRNA and shift their charge from positive to neutral once in 
the body to limit the particle's toxicity. The three other types of 
fats contribute to its structure and stability. Helper lipids also 
aid particles in fusing with cells, cholesterol helps them escape 
from cells’ endosomes, and polyethylene glycol (PEG)-lipids 
prevent them from clumping to help prolong their action. 


Precip § lonizable lipid {( Helper lipid Cholesterol 


then enhancing their ability to break free 
of the sacs of cell membrane, known as en- 
dosomes, that carry them inside. The vast 
majority of LNPs get trapped in these recep- 
tacles and then destroyed or ejected without 
delivering their vaccine payloads, meaning 
“there’s a huge amount of RNA that’s not be- 
ing used,” says Gaurav Sahay, a bioengineer 
at Oregon Health & Science University. 

The shape of ionizable lipids affects an 
LNP’s ability to disrupt an endosome, as does 
cholesterol, one of the other fats in LNPs. To- 
gether with Moderna scientists, Sahay and 
colleagues reported in 2020 that using differ- 


ent forms of cholesterol can enhance rates of 
LNP escape from endosomal entrapment. He 
has founded a company called Enterx Bio- 
sciences to commercialize his discoveries. 

Sanofi has begun to evaluate some of its 
customized LNPs head-to-head in human 
trials. In a study launched in 2021, for ex- 
ample, the company assessed two LNP op- 
tions for delivering an mRNA flu shot it is 
developing. According to preliminary data, 
one lipid formulation proved much better 
at kick-starting anti-influenza immunity, 
Frank DeRosa, head of research and bio- 
markers at Sanofi’s mRNA Center of Excel- 
lence, announced at an investor event in 
December 2021. But the same LNP also 
provoked more frequent side effects at 
higher doses. 

Other firms, including BioNTech and 
Arcturus Therapeutics, have begun to 
explore ways to eliminate polyethylene 
glycol, a compound that helps stabi- 
lize LNPs but has also been linked to 
some types of bad vaccine reactions. 
Many more companies, meanwhile, are 
focused on optimizing lipids for de- 
livering mRNA to treat disease rather 
than prevent it. This requires getting 
mRNAs that encode disease-correcting 
proteins to the precise cells and tissues 
where they are needed—not just to 
the liver, where current LNP formula- 
tions tend to end up following infusion. 
“Delivery of LNPs will be key to really 
expanding the reach of mRNA” beyond 
preventative vaccines, says Dominik 
Witzigmann, co-founder and chief ex- 
ecutive of the Cullis-founded startup 
NanoVation Therapeutics. 

The heightened focus on LNP tech- 
nologies, along with the profits reaped 
from the COVID-19 vaccines, has 
brought increased litigation. Alnylam, 
which helped develop the first ap- 
proved medicine delivered in an LNP— 
a gene-silencing drug marketed since 
2018 to treat a rare neurodegenera- 
tive disorder—claims that its founda- 
tional patents cover lipid components 
of the Moderna and Pfizer-BioNTech 
vaccines. And Arbutus BioPharma, yet an- 
other Canadian firm co-founded by Cullis, 
is seeking damages from Moderna for al- 
legedly infringing on a patent that covers 
LNPs comprising certain ratios of lipids. 

But these intellectual property disputes 
are unlikely to have a chilling effect on 
LNP innovation, says Jacob Sherkow, a bio- 
tech patent attorney with the University of 
Illinois College of Law. “There’s too much 
money at play.” 


Elie Dolgin is a science journalist based 
in Somerville, Massachusetts. 
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INFECTIOUS DISEASE 


Wrestling with bird flu, Europe 
considers once-taboo vaccines 


To lessen the toll of culling, some countries launch vaccine 
trials despite trade implications and public health risks 


By Erik Stokstad 


n March, Christian Drouin, a French 

farmer in the Vendée, discovered that 

his chickens were dying of avian influ- 

enza. He had to take drastic action to 

cull his flock and prevent the infection 

from spreading. Normally, veterinar- 
ians would arrive to gas the birds with 
carbon dioxide. But the veterinary teams 
were overwhelmed with calls to cull flocks 
infected with the virus, now apparently en- 
demic in Europe. 

So Drouin was advised to switch off the 
ventilation fans in his poultry buildings. As 
the temperature rose, most of his 18,000 
birds died of heat stroke over several hours. 
The next day, his neighbors helped him 
bury the carcasses. “After that, I lay down in 
the dark, stunned by what I had done,” he 
told Agence France-Presse. 

Seeking to stamp out the highly patho- 
genic H5N1 strain of avian influenza, 
France and other countries have been cull- 
ing record numbers of poultry—more than 
16 million birds since December 2021 in 
France alone. Last year, the cost there ex- 
ceeded €150 million. Now, faced with the 
desperation of farmers like Drouin, France, 
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the Netherlands, and other hard-hit coun- 
tries have restarted research into a solution 
long considered taboo: vaccinating flocks. 

Ministers in France and other EU coun- 
tries are discussing the idea, and Dutch sci- 
entists have already begun trials of chicken 
vaccines. In southwestern France this week, 
researchers will begin to immunize ducks 
with a newly developed vaccine. And in Oc- 
tober, stakeholders will gather at the World 
Organisation for Animal Health to discuss 
how to lower international barriers to ship- 
ping vaccinated poultry. 

For now, many countries refuse such ship- 
ments because they aren’t confident coun- 
tries with vaccinated birds have controlled 
the virus. Flu experts worry vaccination 
may not completely halt outbreaks, perhaps 
raising the long-term risk that the avian vi- 
rus leaps to humans. And developing and 
administering vaccines will be costly. 

For all these reasons, vaccination is the 
last resort, says avian pathologist Jean-Luc 
Guérin of the National Veterinary School of 
Toulouse. “We use this tool only if we admit 
that we cannot control the infection by clas- 
sical ways.” The United States has not au- 
thorized the use of avian influenza vaccines 
because of the trade implications, hoping 


These turkeys in Denmark and millions of other birds 
in Europe were killed to stop an avian influenza virus. 


culling will stop its current outbreak. The 
U.S. poultry industry is taking a wait-and- 
see approach. But in Europe the devastation 
wrought by the virus, and the cost and lo- 
gistics of culling millions of birds, may be 
changing the calculus. 

In places where the extremely infectious 
new strain of avian influenza has taken 
hold, vaccination “really has the capacity 
to make a huge difference,” says Richard 
Webby, a virologist at St. Jude Children’s 
Research Hospital who studies influenza in 
birds and other animals. In the long term, 
researchers say, living with H5N1 may re- 
quire not just vaccines, but a restructuring 
of dense European poultry operations. 

For 3 decades, ever more strains of avian 
influenza have been emerging in Asia. The 
current H5N1 strain arrived in Europe in 
2021. It was first detected in the United 
States in January and continues to spread 
(Science, 29 April, pp. 441 and 459). 

Some researchers are concerned that vac- 
cinating, if not done carefully, will allow 
H5N1 to persist and continue to mix with 
strains in wild birds, with the risk that it 
might evolve to spread among people. The 
risk for the European Union and United 
States, although low, is probably the highest 
since H5N1 emerged 25 years ago, Webby 
says. “We really don’t want this virus lurk- 
ing around in poultry farms,” adds Adel 
Talaat, an infectious diseases expert at the 
University of Wisconsin, Madison. 

In an encouraging sign, vaccines have 
lessened the impact of recent outbreaks, at 
least in China. In 2017 the country began 
mandatory vaccination of poultry against 
an H7N9 strain that was able to spread to 
people. Vaccination slashed the prevalence 
of the virus in poultry and the number of 
human infections dropped to zero. That ac- 
complishment “could be replicated every- 
where,” says virologist Hualan Chen of the 
Harbin Veterinary Research Institute, who 
developed the vaccines. 

The campaign also paid off for Chinese 
farmers, who resumed producing broiler 
chickens, according to a cost-benefit analy- 
sis published in March in Preventive Vet- 
erinary Medicine. And the United States 
continued to accept Chinese poultry prod- 
ucts, showing trade barriers are not insur- 
mountable. Vaccines also benefit animal 
welfare by reducing the need to cull flocks, 
adds Chen, who “strongly” recommends 
vaccinating poultry against H5 strains. 

Still, the virus strain now in Europe, H5NI1, 
can be difficult to control with vaccines 
because it infects many species, including 
ducks, whereas H7N9 is chiefly a problem in 
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chickens, Webby says. And economics may 
favor culling for sporadic outbreaks. 

But in hard-hit France, vaccine trials are 
starting. Two vaccines will be tested on 
ducks raised for foie gras. Ducks carrying 
bird flu are the “ultimate reservoir,’ Guérin 
says, because they can spread the virus for 
up to 15 days before showing symptoms. In 
the trials, birds will be vaccinated on farms, 
then exposed to the virus in a lab. The goal 
is to reduce the amount of virus circulating 
and so protect other poultry species. 

One vaccine, Volvac Best, is made by 
Boehringer Ingelheim and already used 
in countries outside Europe, including 
Mexico and Egypt, to immunize chickens 
against Newcastle disease and H5N1. Ceva 
created the other vaccine, the first RNA 
vaccine to be tested in poultry, specifically 
for ducks. Results should be available by 
the end of the year, says Gilles Salvat, a vet- 
erinary health expert at the French Agency 
for Food, Environmental and Occupational 
Health & Safety. If the vaccines prove effec- 
tive at lowering viral levels, Salvat hopes 
they might be ready for market by the end 
of 2023. 

After farmers vaccinate flocks, they'll 
need to make sure the virus isn’t circulat- 
ing silently in any birds that were missed 
or didn’t respond fully to a vaccine. They 
will need to swab birds and test for the vi- 
rus, which can spread on boots, clothing, 
tires, and even the wind. Such measures 
will also reduce the risk of spread to hu- 
mans or wild species, says Carol Cardona, a 
veterinarian and avian influenza specialist 
at the University of Minnesota, Twin Cities. 

“Vaccines can help, but are not the golden 
bullet,” says Ron Fouchier, a virologist at 
Erasmus University Medical Center. The 
approach may still require some culling, he 
says, because viruses will continue to evolve 
and may occasionally escape vaccines. 

The current outbreak could be a game 
changer because the virus is spreading in 
many wild bird species. If it becomes en- 
demic in the United States, too, then vacci- 
nation could become necessary, researchers 
say. The U.S. Department of Agriculture is 
modifying existing vaccines and testing new 
ones against the current H5N1 strain. 

In the bigger picture, the Chinese poultry 
sector needs to prevent viruses from spill- 
ing to wild birds, Fouchier says. And Euro- 
pean countries need to restructure to avoid 
having many farms with dense flocks close 
together, researchers say—an even bigger 
challenge than implementing vaccination. 

Cardona says it could take years to op- 
timize and approve vaccines, as well as 
devise a vaccination strategy and reassure 
trade partners. “What are we waiting for?” 
she asks. “We need to get working.” 
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COVID-19 


Pandemic delays continue to 
plague polar science 


Backlog of NSF-funded work will allow few new projects 


By Paul Voosen 


n March 2020, staff at McMurdo Sta- 
tion, the main U.S. research base in 
Antarctica, thought the future was 
bright. Long-planned renovations had 
begun, including the replacement of 
decrepit dorms by glossy new lodges 
capable of housing more than 200 people. 
But then the pandemic struck, shutting 
down most of two summer field seasons at 
McMurdo and other polar research sites in 
Antarctica and Greenland. In some places 
the effects of that shutdown will linger for 
the rest of the decade, the National Science 
Foundation (NSF) announced last week, 
delaying projects and limiting access to 
one of the rarest resources in geoscience: 
time on the ice. 
In Greenland, government entry restric- 


tions kept most researchers away in the 
summer of 2021. Although 
NSF kept its high-altitude 
Summit Station running 
year-round, only minimal 
maintenance occurred, says 
Jennifer Mercer, NSF’s Arc- 
tic section head. With nearly 
1 meter of snow falling each 
year on the camp, it will re- 
quire a lot of literal digging 
out. “We have a constant battle maintain- 
ing buildings above grade,” Mercer says. 

After the dig-out this summer, research 
in Greenland will be about back to normal. 
Not so in Antarctica, where “we're satu- 
rated for a while in key logistics areas,” says 
Stephanie Short, NSF’s head of Antarctic 
logistics. No work has been done on the Mc- 
Murdo renovation for the past 2 years, and 
space in the old dorms had to be reserved to 
house possible COVID-19 cases, leaving the 
agency down by more than 200 beds. “To 
get back to full strength,” Short says, “we 
need that lodging building.” 

For now research in Antarctica will 
prioritize ongoing projects that feature 
either heavy international participation— 
such as the International Thwaites Glacier 
Collaboration—or critical annual measure- 
ments, says Michael Jackson, head of Ant- 
arctic earth sciences at NSF. New starts 
will be biased toward projects led by early- 
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“We're saturated 
for awhile in 


key logistics areas.” 
Stephanie Short, 


career researchers. But some new proj- 


ects will have to be delayed again, he says. 
“That’s heartbreaking for us,” he says. 

One deferred project is a plan to drill 
into Hercules Dome, an expanse of ice 
400 kilometers from the South Pole. Ice 
cores from the dome might record the last 
time the West Antarctic Ice Sheet collapsed 
in a slightly warmer climate more than 
100,000 years ago, and help predict when 
it might happen next. When NSF agreed 
to fund the project in 2020, research- 
ers thought drilling might begin by 2023. 
Now, 2025 is more likely, says Eric Steig, 
the project’s principal investigator at the 
University of Washington, Seattle. 

Steig says the pandemic hit an enter- 
prise that was already stretched. “NSF is 
always planning on more projects [in Ant- 
arctica] than they are likely to be able to 
support, so even without COVID we always 
run into major delays.” And despite the 
agency’s plan to give priority 
to early-career projects, he 
says, many young research- 
ers may be left in the cold, 
as projects led by senior re- 
searchers also support many 
early-career researchers. 
But there are no simple so- 
lutions, and “TI have a lot of 
confidence in the NSF pro- 
gram managers,” he says. 

Despite the isolation of the pandemic, 
one NSF project built new bridges between 
U.S. and Greenlandic researchers. It seeks 
to help Greenlandic communities under- 
stand the effect of climate change on lo- 
cal sea levels, which counterintuitively are 
set to fall by as much as several meters by 
century’s end, as land sheds ice weight and 
rebounds and the gravitational tug of the 
massive ice sheet on the surrounding ocean 
ebbs. During the pandemic, Greenlandic 
researchers kept working without the U.S. 
team, says Kirsty Tinto, a Columbia Uni- 
versity geophysicist on the project. They 
sailed on seafloor mapping cruises and 
interviewed community leaders—hunters, 
fishers, city planners—about how they 
use the waterfront. The pandemic showed 
the joint project’s resilience, Tinto says. “I 
don’t like pandemics. I don’t like global de- 
spair.” But, she says, “I do like having my 
expectations confounded.” 
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FUNDING 


Odds for winning NSF grants 
improve as competition eases 


Agency reports a 17% drop in number of grant applications 
since 2011, troubling some observers 


By Jeffrey Mervis 


ederal research agencies strive to fund 

a healthy percentage of the grant ap- 

plications they receive, ensuring that 

scientists can pursue their best ideas. 

A steadily rising budget is their pre- 

ferred method for maintaining a ro- 
bust success rate. But a fall in applications 
can have the same effect. 

A new report from the National Science 
Foundation (NSF) on its merit review sys- 
tem documents how falling demand has 
boosted success rates at the $8.5 billion 
research agency. Released late last month, 
the analysis shows the annual number of 
applications submitted to NSF has dropped 
by 17% over the past decade, falling from 
51,562 in 2011 to 42,723 in 2020. Success 
rates jumped from 22% to 28% during the 
same period, even though the number of 
grants awarded increased by 8%. The rate 
rose even faster—from 19% to 28%—for the 
agency’s standard research grants. 

Those trends are most visible in NSF's bio- 
logy directorate, where demand has tumbled 
by 50% over the decade and the chances of 
winning a grant have doubled, from 18% in 
2011 to 36% in 2020. Officials credit the shifts 
to their decisions to eliminate fixed deadlines 
for submitting proposals and make other 
tweaks to the grantsmaking process designed 
to ease the workload on beleaguered pro- 
gram officers. The geology directorate, which 
piloted some of the same changes but never 
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adopted them across the board, has seen a 
28% decline in the number of proposals, with 
success rates rising from 31% to 42%. The 
pattern is similar for the engineering direc- 
torate, which dropped deadlines in 2018. 

Higher success rates please researchers 
and administrators. “The ability of faculty to 
get funds for their research is the ultimate 
metric, and success rates tell you that,” says 
Sarah Nusser, a statistician and former vice 
president for research at Iowa State Univer- 
sity. She says she “never paid much attention” 
to how many proposals faculty were submit- 
ting. But some policymakers wonder whether 
fewer cutting-edge research proposals could 
affect the nation’s ability to innovate. 

“We're not pleased that there has been the 
decline,” says Stephen Willard, a biotech ex- 
ecutive who is a member of NSF's presiden- 
tially appointed oversight body, the National 
Science Board. “We're trying to identify the 
reasons, with the goal of turning it around.” 

James Olds, a neuroscientist at George 
Mason University who led the biology direc- 
torate from 2014 to 2018, the period when 
the deadline rules were revised, thinks the 
changes helped improve the quality of pro- 
posals. “Life scientists are always collecting 
more data to test their hypotheses,” he says, 
“so I prefer to see researchers submit a pro- 
posal when it’s ready scientifically, not be- 
cause of the craziness of a 5 p.m. deadline.” 

A senior NSF official, however, isn’t sure 
the changes in the application process ex- 
plain the overall drop in proposals. “The 


The National Science Foundation could speed 
its granting process with more RAPID awards, 
often given for studies of natural disasters such 
as this Alaska landslide. 


merit review report helps identify interest- 
ing trends and this is one of them,” says 
Erika Rissi of NSF’s Office of Integrative Ac- 
tivities, which does the analysis each year. 
“But the report doesn’t diagnose anything, 
so we don’t know why.” 

Higher education officials say many fac- 
tors influence whether a researcher submits 
a grant, including their readiness to par- 
ticipate in a new solicitation or their per- 
ception of the odds of getting funded. The 
COVID-19 pandemic, which caused mas- 
sive disruptions on campus, could also play 
a role. (The report only covers operations 
through September 2020, which spans the 
first 6 months of the pandemic.) 

The trends are not uniform across the 
agency. The computing directorate now 
gets 20% more proposals than a decade 
ago, for example, and a 40% increase in 
its budget has helped it maintain a success 
rate of 24%. Demand in physical sciences 
and math has been flat, and success rates 
have crept up from 27% to 30%. Both direc- 
torates used fixed deadlines for submitting 
proposals during the period. 

NSF's analysis also reveals large differ- 
ences in the trends by gender and race/ 
ethnicity. The number of proposals from 
women has declined by 12% over the decade, 
for example, compared with a 21% drop for 
men. The number of proposals from Black 
and Asian investigators has shrunk by 27% 
and 28%, respectively, a much steeper de- 
cline than for the overall pool. 

The data-rich report includes a sugges- 
tion from the science board for improving 
another crucial metric: the “dwell time” be- 
tween the receipt of a proposal and notify- 
ing the applicant of a funding decision. The 
agency’s goal is to complete action on 75% 
of applications within 6 months, but the re- 
port notes it has fallen short of that target 
for the past 4 years. 

NSF might be able to do better, the board 
said, by funding more proposals that don’t 
require outside vetting. It cited the RAPID 
awards, an NSF-wide program that makes 
quick-turnaround grants of up to $200,000 
to study the impact of a natural disaster or 
a major societal upheaval like the current 
pandemic. NSF quadrupled the number 
of RAPID grants in 2020 after receiving 
$75 million for them in the first massive 
federal COVID-19 relief package. Even so, 
they only made up 2.7% of all NSF research 
proposals that year. 

“We're not saying go easy on them,” Willard 
says. “But this could help with dwell times.” 
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CLIMATE SCIENCE 


‘Hot’ climate models exaggerate Earth impacts 


U.N. report authors say researchers should avoid suspect models 


By Paul Voosen 


ne study suggests Arctic rainfall will 
become dominant in the 2060s, de- 
cades earlier than expected. Another 
claims air pollution from forest fires 
in the western United States could tri- 
ple by 2100. A third says a mass ocean 
extinction could arrive in just a few centuries. 
All three studies, published in the past 
year, rely on projections produced by some 
of the world’s next-generation climate mod- 
els. But many of the models have a glaring 
problem: They predict a future that gets too 
hot too fast (Science, 30 July 2021, p. 474). 
Although modelmakers acknowledge the 
problem, researchers who use the model 
projections to gauge the impacts of climate 
change are sometimes unaware. That 
has resulted in a parade of “faster than 
expected” results that threatens to un- 
dermine the credibility of climate sci- 
ence, some researchers fear. 
Scientists need to get much choos- 


ier in how they use model results, a ° ~-+ All-model average 
group of climate scientists argues in — a 
a commentary published last week in or e eerie Fi 
Nature. Researchers should no lon- & @ High emissions 
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climate model projections, says Zeke es Deep emission cuts 
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mentary. “We must move away from 
the naive idea of model democracy.’ 
Instead, he and his colleagues call 
for a model meritocracy, prioritizing, (0) 
at times, results from models known 
to have more realistic warming rates. 

Overall, climate models remain incredibly 
successful research tools, and nothing about 
this “too hot” generation invalidates the te- 
nets of climate science, says Kate Marvel, a 
climate scientist at NASA’s Goddard Institute 
for Space Studies and co-author of the com- 
mentary. The greenhouse effect is still warm- 
ing the planet. Ice is melting, seas are rising, 
and droughts are becoming more frequent in 
some areas. But the models are not perfect, 
Marvel says. “They’re not crystal balls.” 

The problem of the too-hot models arose 
in 2019 from the Coupled Model Intercom- 
parison Project (CMIP), which combines the 
results of the world’s models in advance of 
the major reports that come out every 7 or 
8 years from the Intergovernmental Panel on 
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Climate Change (IPCC). In previous rounds 
of CMIP, most models projected a “climate 
sensitivity’—the warming expected when 
atmospheric carbon dioxide is doubled over 
preindustrial times—of between 2°C and 
4.5°C. But for the 2019 CMIP6 round, 10 out 
of 55 of the models had sensitivities higher 
than 5°C—a stark departure. The results were 
also at odds with a landmark study that es- 
chewed global modeling results and instead 
relied on paleoclimate and observational re- 
cords to identify Earth’s climate sensitivity. 
It found that the likely value sits somewhere 
between 2.6°C and 3.9°C. 

Researchers have since tracked down the 
causes of the too-hot models, which include 
those produced by the National Center for 
Atmospheric Research, the U.S. Department 


Hot mess 
Because of problems rendering clouds, some climate models run 
hot. Researchers say it's better to use select models than an average. 


2°C, 3°C. That kept useful information from 
the hot models, such as changes in rainfall 
patterns, even if they reached the warming 
thresholds too fast. 

Although IPCC rose to the challenge, it 
didn’t do a great job telling everyone about 
the actual problem, says Hausfather, him- 
self an IPCC co-author. “A large number of 
our colleagues had no idea that the IPCC 
did this,’ he says. And since then, dozens 
of published studies have used projections 
based on the raw average of all CMIP6 
models or, worse, the output of individual 
“hot” models. “It’s not because anybody is 
acting in bad faith,” Marvel says. “It’s just 
because there’s no guidance.” 

Climate impact researchers need to 
emulate the steps IPCC took, Hausfather 
and his co-authors say. First, they 
should avoid the dubious time- 
based scenarios and instead empha- 
size the effects of specific levels of 
global warming, regardless of the 
date those levels are reached. They 


should also use IPCC’s assessed pro- 
jections for when those warming 


levels might arise. And for studies 
where the details of the warming 


trajectory is important, they can use 
select models that predict it better, 


like those produced by NASA and 
the National Oceanic and Atmo- 


spheric Administration. 
Claudia Tebaldi, a climate scientist 
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of Energy, the United Kingdom’s Met Of- 
fice, and Environment and Climate Change 
Canada. They often relate to the way models 
render clouds; one result has been excessive 
predicted warming in the tropics. 

IPCC tried to compensate for this prob- 
lem last year when it published its first 
working group report, which covers the 
physical basis of climate change. IPCC rated 
models on their skill at capturing past his- 
torical temperatures. Then, it used the best- 
performing models to produce its official 
“assessed warming” projections for differ- 
ent fossil fuel emissions scenarios. When 
it came to studying the future changes to 
Earth, IPCC reported results from all the 
models based on degree of warming: 1.5°C, 


at Lawrence Berkeley National Labo- 
ratory, agrees with most of the group’s 
recommendations. However, she says, 
they may underestimate policymak- 
ers’ desire for time-based information, 
which in her experience is nearly always re- 
quested. And some climate impacts, like sea 
level rise, depend on the warming rate, not 
just the absolute amount of warming. 

Researchers should think about going 
even further, and examining whether cer- 
tain models have, for example, big regional 
biases, says Reto Knutti, a climate scientist 
at ETH Zurich who has called for “model 
meritocracy” for more than a decade. As 
more city planners and outside scientists 
turn to these projections, they should first 
be sure to consult a climate model expert. 
“Given that these results guide climate ad- 
aptation and investments of billions of dol- 
lars, that seems like an effort worth doing,” 
Knutti says. 
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THE BIAS HUNTER 


Itiel Dror is determined to reveal the role of bias 
in forensics, even if it sparks outrage 


n February 2021, cognitive psycho- 
logist Itiel Dror set off a firestorm 
in the forensics community. In a 
paper, he suggested forensic patho- 
logists were more likely to pronounce 
a child’s death a murder versus an 
accident if the victim was Black and 
brought to the hospital by the mother’s 
boyfriend than if they were white and 
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By Douglas Starr 


brought in by the grandmother. It was the 
latest of Dror’s many experiments suggest- 
ing forensic scientists are subconsciously 
influenced by cognitive biases—biases 
that can put innocent people in jail. 

Dror, a researcher at University College 
London (UCL), has spent decades using 


real-world cases and data to show how ex- 
perts in fields as diverse as hospital care 
and aviation can reverse themselves when 
presented with the same evidence in differ- 
ent contexts. But his most public work has 
involved forensic science, a field reckoning 
with a history of unscientific methods. In 
2009, the National Research Council pub- 
lished a groundbreaking report that most 
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forensic sciences—including the analy- 
sis of bullets, hair, bite marks, and even 
fingerprints—are based more on tradition 
than on quantifiable science. Since then, 
hundreds of studies and legal cases have 
revealed flaws in forensic sciences. 

Dror’s work forms a connective tissue 
among them. He has shown that most 
problems with forensics do not originate 
with “bad apple” technicians who have 
infiltrated crime labs. Rather they come 
from the same kind of subconscious bias 
that affects everyone’s daily decisions—the 
shortcuts and generalizations our brains 
rely on to process reality. “We don’t actu- 
ally see the environment,” Dror says. “We 
perceive stimuli from the environment 
that our brain represents to us,” shaped by 
feelings and past experience. 

“In the span of a decade, cognitive bias 
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went from being almost totally unheard of 
in forensics to common knowledge in the 
lab,” Brandon Garrett, a professor at the 
Duke University School of Law, wrote in 
his book Autopsy of a Crime Lab: Exposing 
the Flaws in Forensics. “We can especially 
thank Itiel Dror for helping bring about 
the sea change.” 

Dror now travels the world testifying 
in trials, taking part in commissions, and 
offering training to police departments, 
forensic laboratories, judges, militaries, 
corporations, government agencies, and 
hospitals. National agen- 
cies, forensic labs, and 
police forces have ad- 
opted his approach to 
shielding experts from 
information that could 
bias them. 

“T don’t know anybody 
else who’s doing every- 
thing that Itiel is do- 
ing,” says Bridget Mary 
McCormack, chief justice 
of the Michigan Supreme 
Court, who worked with 
Dror on a U.S. Depart- 
ment of Justice task 
force and collaborated 
on studies with him. “His work is monu- 
mentally important to figuring out how we 
can do better. To my mind it’s critical to the 
future of the rule of law.” 

Dror’s previous studies on bias in foren- 
sics caused grumbling, but nothing like 
the reaction to the 2021 paper. This time, 
he used a survey to see whether bias could 
affect decision-making among medical ex- 
aminers. He concluded that nonmedical 
evidence such as the race of the decedent 
or their relation to the caregiver—details 
that most medical examiners routinely 
consider—were actually a source of bias. 

Eighty-five of the country’s most promi- 
nent pathologists demanded its retraction. 
The National Association of Medical Exam- 
iners (NAME) alleged ethical misconduct 
and demanded that Dror’s employer, UCL, 
stop his research. The editor of the Journal 
of Forensic Sciences wrote that he hadn’t 
seen sO many arguments in the journal’s 
65-year history, or so much anger. After de- 
cades challenging forensic experts, Dror had 
gotten into a fight that threatened his career. 


DROR APPEARS to be a mild-mannered 
man, with salt-and-pepper hair and wire- 
rimmed glasses; but that impression dis- 
appears the moment he begins talking. 
He gains momentum like a runaway train, 
detailing his latest study, making a quick 
detour to bring up an example, slipping in 
a funny anecdote, then circling around to 


“Fingerprints don’t lie. ... 
but it’s also true 
that fingerprints don’t 
speak. It’s the human 
examiner who makes 
the judgment, 
and humans are fallible.” 


Itiel Dror, 
University College London 


put a cap on his original point. He speaks 
with a mixture of accents and intonations 
from his upbringing in Israel, his graduate 
work in the United States, and his profes- 
sional life in the United Kingdom. 

Two diamond studs in his left ear hint at 
a nonconformist streak. As the child of ac- 
ademic parents who took frequent sabbati- 
cals, Dror attended five elementary schools 
on three continents. “I’d be the new kid 
who didn’t know the language very well,” 
he says. “I didn’t have time to assimilate or 
conform. It was very difficult, but it gave 
me a lot of independence 
of thought.” 

Dror hated reading 
and the discipline of 
school. But things turned 
around at age 19, after 
he broke his back dur- 
ing paratrooper training 
in the Israeli army—an 
incident he describes as 
a “destructively creative” 
shake-up to his system. 
Confined to a body cast 
for 7 months, he took 
to reading the books he 
had ignored during high 
school. He home-tested, 
and got As in the courses he had previously 
failed. He studied philosophy at Tel Aviv 
University, took a year off to work on a kib- 
butz, and later did a doctorate in psychol- 
ogy at Harvard University, studying mental 
imagery and decision-making. 

One of his projects examined how U.S. 
Air Force pilots use mental imagery to rec- 
ognize enemy jets traveling at high speeds. 
The work caught the attention of David 
Charlton, a prominent British fingerprint 
examiner who had started to have doubts 
about his field. 

“T often wondered if when making fin- 
gerprint comparisons my eyes were the 
same from one day to the next,’ Charlton 
says. “And then I came across this paper 
suggesting that the perception of aircraft 
pilots could change, depending on stresses 
or circumstances. And I wondered if it ap- 
plies to fingerprints as well.” 

He had reason for concern. The United 
Kingdom had been shaken by the scandal 
of Shirley McKie, a Scottish police con- 
stable who was charged with perjury after 
investigators claimed to find her thumb- 
print at a murder scene in 1997. McKie 
was cleared when two American experts 
testified that the thumbprint could not 
have been hers. The Americans had their 
own scandal in 2004, when FBI detained 
an American lawyer, Brandon Mayfield, as 
a suspect in a terrorist bombing of a Ma- 
drid train station. Among 20 near-matches 
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in their fingerprint database, agents fo- 
cused on Mayfield, who had converted 
to Islam and provided legal defense to a 
Portland, Oregon, resident with Taliban 
connections. When Spanish authorities 
found the real bomber, Mayfield sued the 
U.S. government, which agreed to a $2 mil- 
lion settlement. 

Those cases deepened Charlton’s doubts 
about his own objectivity. He contacted Dror, 
who suggested they do some research to- 
gether. They found five fingerprint experts 
who knew about the Mayfield case but had 
not seen the fingerprints. Dror and Charlton 
sent each expert a pair of prints from one of 
the expert’s own previous cases, which they 
had personally verified as “matched,” but told 
them the prints came from the notorious case 
of FBI’s mismatch of Mayfield’s prints with 
the terrorist’s. 

Four of the five experts contradicted 
their previous decision: Three now con- 
cluded the pair was a mismatch, and one 
felt he needed more information. They 
seemed to have been influenced by the pas- 
sage of time and extraneous information. 

“It was so simple and elegant,” Peter 
Neufeld, co-founder of the Innocence Proj- 
ect, says of the study. “And when people in 
the forensic community read it, they got it.” 

In a follow-up study, Dror and Charlton 
gave six experts sets of prints they had 
previously examined along with biasing 
information—that the suspect had either 
confessed or had an alibi. Four of the six 
experts changed their past findings. 

The results turned some of Charlton’s 
colleagues against him. “A lot of people 
wondered if I was trying to destroy the 
profession,’ he says. Angry letters poured 
in to Fingerprint Whorld, the professional 
journal of which Charlton was editor. The 
chair of the Fingerprint Society wrote 
that any fingerprint examiner who could 
be swayed by images or stories “is so im- 
mature he/she should seek employment 
at Disneyland.” 

Charlton was so upset by these reactions 
that he considered abandoning his career. 
“Don’t worry, this is normal,’ he remem- 
bers Dror telling him. “It’s part of the hu- 
man condition. Now let’s do more research 
and see how we can improve things.” 

Dror looked at other biasing factors in 
fingerprint analysis, some of which were 
shockingly innocuous. When police retrieve 
a print from a crime scene, they consult an 
FBI computer database containing millions 
of fingerprints and receive several possible 
matches, in order of the most likely possi- 
bilities. Dror found that experts were likely 
to pick “matches” near the top of the list 
even after he had scrambled their order, 
perhaps because of the subconscious ten- 
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Kerry Robinson was exonerated after serving 18 years fora: a conviction based in part on contested DNA analysis. 


dency to overly trust computer technology. 

“People would say to me fingerprints 
don’t lie,” Dror says. “And I would say yes, 
but it’s also true that fingerprints don’t 
speak. It’s the human examiner who makes 
the judgment, and humans are fallible.” 

Dror and his colleagues are quick to point 
out that bias does not always equal preju- 
dice, but it can foster injustice. Studies have 
shown, for example, that Black schoolchil- 
dren get punished more readily than white 
children for the same misbehavior, because 
many teachers subconsciously assume 
Black children will continue to misbehave. 
And in forensic science, bias can subcon- 
sciously influence experts to interpret data 
in a way that incriminates a suspect. 

If something as seemingly infallible as 
fingerprints could be biased, what could be 
next? Dror set his sights on DNA. When the 
authors of the National Research Council 
study criticized forensic sciences, they made 
an exception for DNA analysis, a method 
developed in the lab that was statistically 
verifiable and scientifically sound. 

But as DNA analysis has gotten more sen- 
sitive and sophisticated, it has also come 
to rely more on human interpretation. For 
example, when investigators find a mixture 


of several people’s DNA at a crime scene, it’s 
up to the analyst to tease apart the contrib- 
utors. It’s a complicated and subtle process, 
one that Dror found can be influenced by 
context. Consider the case of Kerry Robin- 
son in Georgia, who was accused in 2002 of 
taking part in a gang rape. The state based 
its case on the plea bargain testimony of 
Tyrone White, who investigators had identi- 
fied as the main perpetrator and who bore 
Robinson a grudge. The state’s two DNA ex- 
perts found that Robinson’s DNA “could not 
be excluded,” from the mixture of DNA 
found at the crime scene, and the jury 
found him guilty. 

Greg Hampikian, a genetics professor 
then at the Georgia Innocence Project, 
sent DNA data from the case to Dror, who 
shared it with 17 DNA analysts unfamiliar 
with the case. Only one agreed with Geor- 
gia’s analysts; the other 16 either excluded 
Robinson’s DNA or said they could not come 
up with a result. Dror’s conclusion: Even 
DNA analysis, the “gold standard” of foren- 
sic science, was subject to human bias. The 
state did not release Robinson until 2020, 
when Hampikian submitted other exoner- 
ating information. Robinson had already 
served 18 years of his 20-year sentence. 
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Over the years Dror and other research- 
ers have found bias just about everywhere 
they’ve looked—in toxicologists, forensic 
anthropologists, arson investigators, and 
others who must make judgments about 
often ambiguous crime scene evidence. Yet 
juries find forensic evidence compelling, 
Dror and others have found. 

Many examiners feel “impervious to bias,” 
says Saul Kassin, a psychologist at John Jay 
College of Criminal Justice, “as if they’re not 
human like the rest of us.” In 2017, Kassin 
and Dror asked more than 400 forensic sci- 
entists from 21 countries about their percep- 
tions of bias. They found that whereas nearly 
three-quarters of the examiners 
saw bias as a general problem, 
just over 52% saw it as a concern 
in their own specialty, and only 
26% felt that bias might affect 
them personally. 

Dror says the best approach 
to fighting bias is to shield ex- 
perts from extraneous informa- 
tion, similar to the “blinding” in 


Target-driven bias 


with a skull fracture and brain hemorrhage 
was brought to an emergency room and 
died shortly thereafter. In one scenario, the 
child was white and was brought in by the 
grandmother. In the other, the child was 
Black and brought in by the mother’s boy- 
friend. The survey asked participants to 
decide whether the manner of death was 
undetermined, accidental, or homicide. 
Dror analyzed the results and found 
that of the 133 people who answered the 
survey, 32 concluded the death was a ho- 
micide. And a disproportionate number of 
those—23—had received the scenario with 
the Black child and the boyfriend. Partici- 


A glossary of bias 
Itiel Dror and his collaborators have coined various terms to describe how bias 
sneaks into forensic analysis—and how experts perceive and react to their biases. 


drawing a bull’s-eye around where it hits 


Subconsciously working backward from a suspect to 
crime scene evidence, and thus fitting the evidence to 
the suspect—akin to shooting an arrow at a target and 


tion out there to begin a discussion,’ Dror 
says of the study. 

He got more of a discussion than he 
expected. The journal was swamped with 
angry letters from medical examiners. One 
derided the study as “rank pseudoscience.” 
Another, signed by the president of NAME 
along with 84 other pathologists, excori- 
ated the study as “fatally flawed” and “an 
abject failure of the peer review process,” 
and demanded its retraction. (Michael 
Peat, editor of the journal, declined to re- 
tract the article, saying it had been peer 
reviewed before publication and_ re- 
reviewed by a respected biostatistician fol- 
lowing the complaints.) 

Many pathologists pointed 
out that the experimental design 
linked two unrelated variables— 
the race of the child and their re- 
lationship to the caretaker. They 
were further inflamed by Dror’s 
labeling the scenarios “Black con- 
dition” and “White condition,’ 
when they had reason to suspect 


scientific experiments. He calls 
the process Linear Sequential 
Unmasking, in which the analyst 


Confirmation bias 


evidence to the contrary 


Focusing on one suspect and highlighting the evidence 
that supports their guilt, while ignoring or dismissing 


that the caretaker, not the race, 
was the relevant variable. Statis- 
tics show a boyfriend of any race 


only sees the evidence that’s di- 
rectly relevant to their task. Some 
authorities have endorsed the ap- 
proach. The United Kingdom’s 


Bias cascade 


of the crime scene 


When bias spills from one part of the investigation to 
another, such as when the same person who collects 
evidence from a crime scene later does the laboratory 
analysis and is influenced by the emotional impact 


is far more likely to harm a child 

in his care than a grandmother. 
“To introduce race ... appears 

to be an effort to label the survey 


Forensic Science Regulator rec- 


responders, and their colleagues 
by proxy, as racist,” said the let- 
ter from the 85 practitioners. 
“Had this survey been done with 


the races reversed ... White cases 
were more likely to be called 


homicide and Black cases more 
likely to be called accident.” 


They contended that Dror was 
using inflammatory language to 


get headlines. And they noted 
that other factors could have 


ommends it as “the most power- Bias snowball Akind of echo chamber effect in which bias gets 

¥ ‘ amplified because those who become biased then 
ful means of safeguarding against . 

; : bias others, and so on 
the introduction of contextual 
bias” FBI adopted the process Bias blind spot The belief that although other experts are subject 
following the Mayfield case: Be- to bias, you certainly are not 
cause humans tend to see similar- Expert immunity The belief that being an expert makes a person 
ities between objects viewed side objective and unaffected by bias 
by side, agents now document the Illusion of control The belief that when an expert is aware of bias, 
features of a crime scene finger- they can overcome it by a sheer act of will 
print on its own before compar- —_Bad apples The belief that bias is a matter of incompetence 
ing it to a suspect’s prints. or bad character 
cae concen lens — Technological The belief that the use of technology, such as 

Ponce i eae Netierands Heaan protection computerized fingerprint matching or artificial 


to blind fingerprint examiners to 

details of a crime investigation 

that might influence their analysis, such as 
the condition of the body or the urgency 
of the case, says John Riemen, the police 
force’s lead biometrics specialist. The ap- 
proach ensures “you're looking at finger- 
prints, and not at your biases,” he says. 


IT WAS AN ATTEMPT to win medical examin- 
ers over to this approach that landed Dror 
in hot water. In 2019, he got a message 
from Daniel Atherton, a pathologist at the 
University of Alabama, Birmingham, who 
wanted him to look at some data he had 
collected. Atherton had sent a survey to 
713 pathologists across the country posit- 
ing one of two scenarios in which a toddler 
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intelligence, guards against bias 


pants reading the “Black condition” were 
five times more likely to conclude homi- 
cide than accident, whereas participants in 
the “White condition” ruled accident more 
than twice as frequently as homicide. 

“Their decisions were noticeably af- 
fected by medically irrelevant contextual 
information,’ Dror, Atherton, and their col- 
leagues wrote in their paper, published in 
the Journal of Forensic Sciences. 

The paper also included a survey of 
10 years of Nevada death certificates show- 
ing an apparent correlation between Black 
deaths and findings of homicide versus 
accident—influenced, perhaps, by cultural 
biases. “I just wanted to get that informa- 


played a role in the pathologists’ 
decisions, such as their level of 
experience, local crime statis- 
tics, and office policies, none of 
which Dror had considered. 

Stephen Soumerai, an expert in research 
design at Harvard Medical School, agrees 
that linking a known risk factor for homicide 
(caregiver relationship) to a nonwhite race is 
problematic. And the survey of Nevada death 
certificates failed to investigate other possi- 
ble explanations beyond race, he says. “The 
hypothesis is reasonable and important, but 
the research does not adhere to basic prin- 
ciples of research design,” he says. 

Dror admits he would have been wise to 
use neutral terms to designate the two ex- 
perimental groups. But he doesn’t concede 
that the study is flawed. “It is a first study 
to examine and establish that there is bias 
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in forensic pathology,’ he says. Dror agrees 
that statistics do show an unrelated care- 
taker is more likely to harm a child than 
a grandmother. But such generalizations 
should not affect how examiners diagnose 
individual cases. 

Judy Melinek, CEO of PathologyExpert, 
Inc. who practices forensic pathology in 
Wellington, New Zealand, agrees. “I’ve seen 
too many cases where innocent caregivers 
were prosecuted for accidental child deaths 
because forensic pathologists made as- 
sumptions based on larger trends.” 

Dror says antibias strategies are especially 
important to medical examiners because 
many work hand in hand with police who 


formation about the case that they can to 
make a correct diagnosis, he says. 

They determine both cause of death—the 
injury or illness that killed a person—and the 
manner of death, which describes how the 
death came about. If a dead man is found 
sitting in his car with the engine idling, the 
garage door closed, and high levels of carbon 
monoxide in his blood, the autopsy would 
likely conclude that the cause of death was 
carbon monoxide poisoning. But the man- 
ner of death would remain “undetermined” 
unless investigators found signs pointing to 
suicide, such as a note, recent job loss or di- 
vorce, or statements from friends that he had 
been depressed. 


Brandon Mayfield was detained as a bombing suspect based on a flawed FBI fingerprint analysis. 


might influence them. One solution is to 
have a laboratory’s case manager safeguard 
details about an investigation and unveil 
them to a medical examiner only as needed 
to determine the manner of death—similar 
to the Linear Sequential Unmasking used by 
the Dutch police, among others. “It’s all about 
looking at the right evidence in the right se- 
quence,” Dror says. 

That approach represents a “clueless” 
understanding of how medical examin- 
ers work—one that cognitive psychologists 
have held for years, says William Oliver, a 
retired professor of pathology at East Caro- 
lina University’s Brody School of Medicine 
and a former board member of NAME. Un- 
like other forensic examiners, who match 
patterns from a particular type of evidence, 
medical examiners must gather all the in- 


690 13 MAY 2022 + VOL 376 ISSUE 6594 


“Manner is not a scientific determina- 
tion, and it is not meant to be,” Oliver 
says. Aggregate statistics—like the rates at 
which grandmothers and unrelated care- 
takers harm children—are crucial to mak- 
ing that judgment, he says. 

The acrimony around Dror’s paper snow- 
balled. On 19 March 2021, Brian Peterson, 
a member of NAME and chief medical ex- 
aminer for Milwaukee County in Wiscon- 
sin, filed a formal ethics complaint with 
the association against the four patholo- 
gists who collaborated with Dror. Their 
paper would “do incalculable damage to 
our profession,” exposing every medical ex- 
aminer to withering cross-examination at 
trials, he wrote. 

“T was shocked at the reaction,” says Joye 
Carter, a forensic pathology consultant and 


co-author on the study, who was named in 
the complaint. “We’re supposed to be fact 
finders, but people got whipped up into 
this ridiculous attitude that they were be- 
ing persecuted.” 

Both the Innocence Project and the Legal 
Defense Fund came to the pathologists’ de- 
fense, and NAME dismissed Peterson’s com- 
plaint in May 2021. (Carter, a prominent 
pathologist who was the first Black chief 
medical examiner in the United States, re- 
signed from NAME. “There’s no way I can 
be part of a group like this,” she says.) 

But Dror faced a separate attack from 
NAME’s leadership. In an 8 March 2021 let- 
ter to UCL, NAME’s then-President James 
Gill and Executive Vice President Mary 
Ann Sens accused him of intentional eth- 
ics violations, including misleading par- 
ticipants by not telling them the study was 
about race and bias. 

The letter triggered a hearing at UCL’s 
ethics board, which Dror says could have 
ended in his dismissal. He argued that dis- 
closing the nature of the study would have 
biased the results, and at one point he be- 
came so emotional that he had to leave the 
room to regain his composure. Ultimately, 
the board found in his favor, ruling that 
“the allegation is mistaken.” 

The question of bias in autopsies rocketed 
to the headlines after Minneapolis police of- 
ficer Derek Chauvin killed George Floyd on 
25 May 2020. During the trial in April 
2021, the local medical examiner for Hen- 
nepin County in Minnesota testified that 
the manner of death was “homicide,” as did 
other pathologists. But an expert hired by 
Chauvin’s defense team, former Maryland 
Chief Medical Examiner David Fowler, tes- 
tified that Floyd had so many underlying 
health challenges that the manner of death 
was “undetermined.” 

Chauvin was found guilty, but Fowler’s 
testimony outraged other pathologists and 
physicians, who saw in his conclusions a 
pro-police bias. More than 400 of them 
signed a petition to Maryland Attorney 
General Brian Frosh demanding an in- 
vestigation into all the death-in-police- 
custody cases during Fowler’s 17 years in 
office. Frosh recruited seven international 
experts to design the study, including Dror. 
And despite all the blowback Dror has re- 
ceived for trespassing in the field of foren- 
sic patho-logy, he agreed to participate. 

“If my work results even in one per- 
son not getting wrongly convicted, or one 
guilty person not going free, then it’s worth 
all the grief I’ve been getting,” he says. “And 
maybe not just one person. Hopefully this 
is going to change the domain.” 


Douglas Starr is a journalist in the Boston area. 
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PERSPECTIVES 


CLIMATE CHANGE 


The limits of 
forest carbon 


sequestration 


Current models may be 
overestimating the 
sequestration potential 
of forests 


By Julia K. Green! and Trevor F. Keenan? 


Ithough nations are grappling with 

reducing carbon emissions to mini- 

mize climate change risks, they ac- 

tually need to remove carbon diox- 

ide from the atmosphere if the goal 

is to keep warming to less than 1.5°C 
(1). Although there has been substantial 
investment in technologies focusing on 
carbon capture, there is also interest in 
natural solutions, such as planting trees 
(2, 3). Trees remove CO, from the atmo- 
sphere through photosynthesis, and for- 
ests sequester CO, in the form of biomass 
and soil carbon. Though it seems logical 
that increased photosynthesis should lead 
to increased growth, multiple reports sug- 
gest that this is often not the case (4, 5). 
The interaction between photosynthesis, 
tree growth, and carbon sequestration 
has large implications for future carbon 
uptake and is a topic that remains widely 
debated. On page 758 of this issue, Cabon 
et al. (6) contribute to this debate by ex- 
amining global photosynthesis estimates 
and tree ring measurements as a measure 
of growth. 

Carbon sequestration in land ecosystems 
has been gradually increasing since the 1970s 
(7). CO, is a necessary component for plant 
photosynthesis, and rising concentrations 
of atmospheric CO, from anthropogenic 
emissions have led to higher rates of carbon 
uptake by plants, a phenomenon known 
as carbon fertilization (8). Additionally, 
increases in greenhouse gases have length- 
ened the growing season indirectly through 
warming, leading to additional increases 


Cabon et al. highlight the importance of considering 
processes other than photosynthesis to estimate how 


much carbon trees can sequester. 
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in ecosystem carbon uptake (7, 9, 
10). These increases in response 
to rising atmospheric CO, con- 
centration have led to claims that 
ecosystems are mostly limited by 
the amount of atmospheric CO,. 
However, a large body of research 
has indicated that increases in 
atmospheric CO, do not necessar- 
ily translate to increased carbon 
sequestration because of other 
strong limitations on plant growth 
(11-13). Even if more carbon is 
available to vegetation, other 
factors are needed to help stoke 
growth from photosynthesis. Most 
notably, nutrient abundance and 
water content in the soil, as well 
as temperature, can limit growth 
(see the figure). This complica- 
tion, best characterized as a di- 
chotomy of whether forest carbon 
sequestration is carbon source 
limited (limited by how much 
carbon the plant can photosyn- 
thesize) or sink limited (iimited by 
how much the plant can grow be- 
cause of all environmental factors), 
has large implications for how 
much forests can sequester carbon. 

To understand the restrictions on plant 
growth related to source and sink limita- 
tions, Cabon et al. used gross primary 
production (GPP) estimates—which de- 
note the amount of carbon taken up by 
plants during photosynthesis—and com- 
bined them with tree ring data from the 
International Tree-Ring Data Bank to per- 
form a cross-biome correlation analysis. 
They compiled the most comprehensive 
tree ring dataset to date and identified 
source-sink relationships across the globe. 
They demonstrated that there is a strong 
decoupling between vegetation photosyn- 
thesis and the radial growth of a tree trunk, 
which shows substantial variation based 
on vegetation type (e.g., angiosperm versus 
gymnosperm), ecosystem traits (e.g., forest 
age and nutrient availability), and climate 
(e.g., more decoupling in cooler, more arid 
climates). These results emphasize that the 
relationship between photosynthesis and 
growth is not as simplistic as the current 
representation in models, which typically 
depict vegetation carbon sequestration as 
being closely related to GPP (/4). 

The results reported by Cabon et al. have 
implications for using natural ecosystems 
to sequester carbon and for the success of 
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Source limitation 
Photosynthesis provides the 
carbon source for plant growth. 
It is primarily limited by 
sunlight, CO,, water, and 
temperature. 


Source and sink limitations on plant growth 
Plant growth can be restricted by source and/or sink limitations. 
Although source and sink limitations share several controlling 
factors, the time scales on which they operate and their sensitivity 


to these factors often differ. 
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natural climate solutions, such as plant- 
ing trees, in combating climate change. If 
less carbon than predicted will be stored 
in biomass in the coming years, the utility 
of this ecosystem service is greatly dimin- 
ished. Additionally, the results emphasize 
caution when interpreting findings solely 
on the basis of GPP estimates that make 
generalizations about carbon storage. For 
example, a GPP increase in the springtime 
from plants producing their leaves earlier 
may not translate to an increased overall 
growth by the end of the year. 
Nevertheless, the use of GPP estimates 
and tree ring data to understand the de- 
gree to which growth and carbon seques- 
tration by trees are inherently sink limited 
does have some constraints. For example, 
related to the use of GPP data, a portion of 
the CO, absorbed during photosynthesis is 
released back into the atmosphere through 
plant respiration. Therefore, the amount 
of carbon available to plants for growth 
is actually not GPP, but the carbon gains 
during GPP minus the autotrophic respi- 
ration losses—i.e., the net primary produc- 
tion (NPP). Many studies examining the 
relationship between plant carbon uptake 
and growth have used NPP data, which are 
more directly linked to vegetation growth 
(15) and have a stronger relationship with 
biomass. GPP is therefore not necessarily a 
reliable proxy for the carbon source avail- 
able for growth. For this reason, studies 


Plant growth is also limited by 
factors such as the availability 
of nonstructural carbohydrates 
(NSCs), nutrients, and water and 
other environmental factors. 


that measured NPP at some of the 
same sites as Cabon et al. have re- 
ported a stronger correlation be- 
tween carbon source activity and 
tree ring widths, without a role of 
sink limitation (15). Also, related 
to the use of tree ring data to in- 
fer tree growth, plants allocate 
carbon not only to woody above- 
ground biomass, but also to their 
root systems, their leaves, and 
pools of nonstructural carbohy- 
drates that are stored throughout 
the plant. Plants alter their car- 
bon allocation depending on en- 
vironmental changes, forest age, 
and the time of year. Therefore, 
there are uncertainties when 
one infers the amount of car- 
bon sequestered based solely on 
tree ring measurements because 
woody growth is not synonymous 
with total carbon sequestered. 
Despite these limitations, Cabon 
et al. show that carbon projec- 
tions using the current modeling 
NSC framework may be overestimating 
the carbon sequestration in for- 
ests. Their findings highlight the 
need for an in-depth assessment 
of the degree to which vegetation mod- 
els effectively represent growth dynamics 
and the degree of carbon sink and source 
limitations. To do so will require a better 
understanding of these limitations, which 
should be possible with new methodolo- 
gies of measuring autotrophic respiration 
and NPP directly at flux tower sites, as well 
as an increase in the number of collocated 
measurements related to carbon allocation 
and sequestration. | 
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AGING 


Immune cells impede repair of old neurons 


Interfering with age-related neuroimmune interactions promotes nerve regeneration 


By Nicolas Gaudenzio* and Roland S. Liblau** 


he regenerative capacity of older 

people is reduced, resulting in de- 

creased tissue function and resilience. 

Accordingly, the regeneration of the 

sciatic nerve after injury has been re- 

ported to be less efficient and slower 
in older people (J). One of the hallmarks of 
aging is altered intercellular communication, 
which is often accompanied by increased 
density of immune cells within tissues and 
excessive release of proinflammatory me- 
diators, called inflammaging (2, 3). In this 
context, the immune system disturbs tissue 
homeostasis and impedes functional recov- 
ery. However, the precise mechanisms un- 
derlying this pathophysiological process are 
largely elusive, which is a barrier to rational 
treatment design. On page 715 of this issue, 
Zhou et al. (4) describe a mechanism by 
which aged sensory neurons release the che- 
moattractive protein C-X-C motif chemokine 
ligand 13 (CXCL13). Upon sciatic nerve injury 
in aged, but not young, mice, this results in 
the recruitment of CD8* T cells that prevent 
axonal regeneration. 

T cells are endowed with both antigen 
specificity and memory capacities, and neu- 
ron-T cell interactions have been largely doc- 
umented during infectious, degenerative, and 
autoimmune diseases of the nervous system 
(5-7). Neurons can play a central role in this 
process by releasing inflammatory mediators 
and recruiting immune cells to the site of 
tissue damage, thereby contributing to usu- 
ally beneficial and reparative inflammatory 
processes. The development of a local inflam- 
matory reaction needs to be finely regulated 
to avoid the destruction of postmitotic cells 
such as neurons. However, the nervous sys- 
tem (and many other tissues) is gradually in- 
filtrated by antigen-experienced T cells upon 
aging. This progression is well documented 
by Zhou e¢ al. in the peripheral nervous sys- 
tem and by recent reports in the brain and 
spinal cord in both mice and humans (8-1). 
The increased accessibility of aged T cells to 
distant tissues likely represents an important 
contributing factor (2, 8). Moreover, repeated 
tissue insults and infections, which accumu- 
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late with age, may each leave an additional 
local population of T cells that can rapidly 
fight the previously encountered intruder. 
Of note, the increased density of CD8* T cells 
with cytotoxic potential may play a beneficial 
role through the elimination of senescent 
cells. The proportion of senescent cells in- 
creases in tissues with age; they undergo a 
stable cell-cycle arrest and secrete proinflam- 
matory and profibrotic factors and thereby 
interfere with organ function and contribute 
to age-related diseases (12). 


nuclear factor kB2 (NF-KB2). Upon sciatic 
nerve injury in aged, but not young, mice, 
the release of CXCL13 promotes the recruit- 
ment of CD8* T cells expressing CXCR5 (the 
receptor for CXCL13) to the spinal ganglion 
where the cell bodies of sensory neurons 
are located. These recruited CD8* T cells 
interact physically with sensory neuron cell 
bodies and prevent axonal regeneration, 
through a caspase 3-dependent reduction 
in phosphorylated AKT and S6 ribosomal 
protein. Consequently, nerve repair is defec- 


Age-associated nerve injury responses 

In young mice, nerve injury usually triggers a regenerative program, which leads to nerve repair. In old mice, 
there is a high density of CD8*T cells, and injury causes the release of CXCL13 by sensory neurons. This 
recruits CXCR5* CD8* T cells, which engage MHC-1-antigen complexes elec on the cell body of injured 


sensory neurons, resulting in defective axonal repair. 


CXCL13, C-X-C motif chemokine ligand 13; CXCR5, C-X-C motif 
chemokine receptor 5; MHC-1, major histocompatibility complex 
class 1; P, phosphorylation; TCR, T cell receptor. 


However, recent reports suggest that the 
protection afforded by tissue-resident CD8* T 
cells can be hijacked to favor age-associated 
miscalibrated responses or repair decline. 
Consistently, age-associated brain CD8* T 
cells exacerbate acute ischemic injury and 
neurodegeneration following traumatic 
brain injury (8). Increased numbers of brain- 
infiltrating CD8* T cells have also been docu- 
mented in neurodegenerative diseases, such 
as Alzheimer’s disease, although their precise 
role is unknown (13). Zhou et al. unveil a pre- 
viously unrecognized molecular communica- 
tion between sensory neurons and immune 
cells that is inherent to the aged nervous 
system, and which is triggered by an aber- 
rant release of neuron-derived CXCL13, as a 
result of increased expression of the phos- 
phorylated form of the transcription factor 


CXCR5* cor 


* MHC-1-TCR 
interaction 


Phosphorylation of AKT 
and S6 in spinal ganglions 
promotes nerve repair. 


Nerve repair 


Defective repair 


CXCR5* CD8*T cell 
engagement causes 
caspase 3—dependent 
inhibition of AKT and S6. 


tive in old animals (see the figure). 

Albeit of different focus, a recent study 
revealed the accumulation of antigen-experi- 
enced CD8* T cells that produce the cytokine 
interferon-y in old mouse brains that can 
alter the proliferation of neural stem cells in 
the subventricular zone (9). Such cytokine- 
mediated effects could erode regenerative 
capacities and thereby contribute to brain 
aging, possibly reflecting another facet of the 
process identified by Zhou e¢ al. 

During aging, transmigration of T cells 
across tissues increases, in part owing to 
greater permeability of blood vessels (14), 
which likely contributes to their accumula- 
tion in tissues over time. Intriguingly, senes- 
cent T cells have been reported to propagate 
certain features of premature aging through- 
out the body (75). Zhou et al. found that CD8* 
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T cells infiltrating the spinal ganglion (either 
preexisting or attracted there upon injury) of 
old mice directly interact with neurons in an 
antigen-dependent manner, then release per- 
forin and granzyme B that trigger neuronal 
caspase-3 activation without inducing cell 
death. The identity of the antigen(s) involved 
in this deleterious neuron-T cell cross-talk is 
unknown. 

Zhou et al. uncover a cellular and molecu- 
lar mechanism that underpins regenerative 
decline in the nervous system of old mice. 
Although it is unknown if this mechanism 
occurs in humans or is specific to the spinal 
ganglion, the authors investigated whether 
blocking this pathway might offer a therapeu- 
tic opportunity to promote sciatic nerve in- 
jury repair. They showed that neutralization 
of CXCL13 through administration of mono- 
clonal antibodies prevents the recruitment of 
CXCR5-expressing CD8* T cells in the spinal 
ganglion, which promoted axonal regenera- 
tion and functional recovery following sciatic 
nerve injury in aged mice. In an elegant set 
of complementary experiments, Zhou et al. 
conclusively demonstrate that the specific 
targeting of CD8* T cells expressing CXCR5 
phenocopies the beneficial effect of CXCL13 
neutralization. However, CXCR5 is expressed 
at the cell surface of numerous cell types, in- 
cluding B cells and a subset of helper CD4* 
T cells, guiding them to lymphoid follicles 
where they can engage in productive bidirec- 
tional interactions. Moreover, it is important 
to keep in mind that CXCR5-expressing lym- 
phocytes, including CD8* T cells, are key to 
mounting an appropriate immune response 
against viruses and likely against cancer cells. 
Therefore, in a future clinical setting, patients 
systemically and/or chronically treated with 
CXCL13-CXCR5 antagonists may be exposed 
to a higher frequency of (re)infections and/or 
neoplasia. Determining the best timing and 
route of administration of such treatment 
will also be key to opening a safe path for fu- 
ture therapeutic intervention. 
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Mapping cell types across 


human tissues 


Single-cell analyses reveal tissue-agnostic 
features and tissue-specific cell states 


By Zedao Liu and Zemin Zhang 


ur understanding of how cells form 

distinct tissues and organs and inter- 

act with each other is limited. Recent 

single-cell RNA sequencing (scRNA- 

seq) analyses have described the land- 

scapes of individual cell types, along 
with their abundance and interactions, in 
homeostasis and diseased conditions (J—5), 
but these studies are often limited to a single 
organ. A systematic comparison of cell types 
across different tissues is needed to under- 
stand shared and variable transcriptional 
features and how these specializations are 
important for organ function. On pages 711, 
712, and 713 of this issue, The Tabula Sapi- 
ens Consortium (6), Eraslan et al. (7), and 
Dominguez Conde e¢ al. (8), respectively, as 
well as Suo et al. (9), report pan-tissue sin- 
gle-cell transcriptome atlases covering more 
than a million cells, including 500 cell types, 
across more than 30 human tissues from 68 
donors. These four studies apply rigorous 
ontologies to consistently annotate and com- 
pare single cells between organs. 

The interrogation of these large datasets 
reveals tissue-agnostic and tissue-specific cell 
features, identifies rare cell types, and pro- 
vides insights into cell states that are likely 
to underlie disease pathogenesis (see the fig- 
ure). The Tabula Sapiens Consortium created 
a human reference atlas across 24 different 
tissues and organs using scRNA-seq, leading 
to the characterization of more than 400 cell 
types spanning epithelial, endothelial, stro- 
mal, and immune cell compartments. Eraslan 
et al. took a complementary approach by 
applying single-nucleus RNA sequencing 
(snRNA-seq) to eight human tissue types 
and profiled neuronal cells, muscle cells, and 
adipocytes, which are hard to dissociate and 
capture using scRNA-seq. These cross-tissue 
approaches recapitulated conserved cell-type 
features and revealed cell-state adaptations 
to distinct tissue environments. 

For example, the ‘Tabula Sapiens 
Consortium discovered that endothelial cells 
from lung, heart, uterus, liver, pancreas, fat, 
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and muscle exhibit the most distinct tran- 
scriptional signatures, suggesting highly 
specialized functions, whereas endothelial 
cells from the thymus, vasculature, pros- 
tate, and eye resemble one another. The 
pan-tissue approach led to the discovery of 
SLC14A1 (solute carrier family 14 member 1) 
as a marker for heart endothelial cells, likely 
reflecting specialized metabolism in cardiac 
blood vessels. Eraslan et al. also found rare 
cell types, such as neuroendocrine cells in the 
prostate and enteric neurons in the esopha- 
gus. Additionally, the corroborative use of 
both high-throughput 10X and full-length 
SMART-seq2 single-cell transcriptome data 
allowed the quantification of splicing isoform 
usage at the single-cell level, thereby reveal- 
ing differential exon usage patterns for genes, 
including MYL6 (myosin light chain 6) and 
CD47, in different cell-type compartments. 

In addition to tissue-resident immobile cell 
types, the immune system is central to execut- 
ing host defense and orchestrating tissue ho- 
meostasis. However, the current understand- 
ing of the components of the human immune 
system is incomplete, in part because of the 
multiple tissues involved in immune cell de- 
velopment and trafficking. Comparing im- 
mune cells from different organs could reveal 
shared and tissue-specific immune cell states 
that reflect adaptations to different local tis- 
sue niches. For example, analysis of myeloid 
cells across organs by Eraslan et al. revealed a 
conserved macrophage differentiation trajec- 
tory across tissues, with common monocyte 
precursors differentiating into macrophages 
that express class II human leukocyte anti- 
gen (HLAII"#") in tissue immunity and mac- 
rophages that express lymphatic vessel endo- 
thelial hyaluronan receptor 1 (LYVE1"*") for 
blood-vessel integrity support and inhibition 
of immune cell infiltration. 

Dominguez Conde et al. and Suo et al. pro- 
filed human adult and developing immune 
cells with scRNA-seq. The adult immune cell 
atlas from Dominguez Conde e¢ al. provides a 
detailed phenotyping of innate and adaptive 
immune cells across lymphoid and nonlym- 
phoid organs. Tissue-specific immune cell 
subsets emerged, with variable chemokine 
and tissue-residency features reflecting their 
adaptation to organ microenvironments. 
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Pan-tissue single-cell atlases 


Four pan-tissue atlases were constructed using single-cell and single-nucleus RNA sequencing (ScRNA-seq and 
snRNA-seq) of >1 million cells, covering 500 cell types across >30 human tissues. Cross-tissue comparison 

of cell types and their transcriptional features revealed rare cell types, tissue-agnostic features, and tissue- 
specific cell states. Disease-associated cell types were also discovered. 


scRNA-seq and snRNA-seq of >30 
adult and embryo tissue types 


Monocyte 


© 


Enteric 
neuron 


CCR9, C-X-C chemokine receptor type 9; CRTAM, 
cytotoxic and regulatory T cell molecule; CX3CR1, 
CX3C chemokine receptor 1; GWAS, genome-wide 
association study; HLAIl, class Il human leukocyte 
antigen; LYVE1, lymphatic vessel endothelial 
hyaluronan receptor 1; Tew eyjpar effector 
memory/effector memory recently activated T 
cell; T,,,, resident memory T cell. 


Memory T cells are antigen-experienced T 
cells, and three subsets of memory CD8* T 
cells were found in the adult immune cell 
atlas, with effector memory/effector memory 
recently activated T (T,,, santa) cells prevalent 
in blood-rich sites, resident memory T (T,,,) 
cells enriched in intestinal sites, and resident 
memory/effector memory (T,,,, rem T cells pre- 
dominant in spleen and bone marrow; each 
subset exhibited distinct chemokine receptor 
expression, which might drive their tissue 
localization. 

The prenatal immune cell atlas from Suo 
et al. compares cell states across gestation 
stages and reveals conserved and tissue-spe- 
cific trajectories of immune cell differentia- 
tion. Monocytes develop in the bone marrow 
and can disseminate into peripheral tissue. 
In tissue-enrichment analyses of monocyte 
subsets, a C-X-C chemokine receptor type 4 
(CXCR4"8")-to-CCR2"*" transition was dis- 
covered to accompany monocyte egress from 
the bone marrow to the peripheral tissues 
where CCR2"" monocytes mature further 
into an interleukin-18-expressing (IL1B*) 
state. Furthermore, systematic comparisons 
of immune cell progenitors revealed a high 
abundance of B cell progenitors across al- 
most all prenatal organs, in contrast to the 
thymus-restricted distribution of T cell pro- 
genitors. This finding builds on the previous 
notion that hematopoiesis is limited to cer- 
tain tissues such as the liver, bone marrow, 
and yolk sac and requires further research. 

These cross-tissue atlases could also 
help reveal cell types that underlie disease 
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Disease genes and GWAS 
loci are mapped to cell types 
across different tissues 


pathogenesis. Although numerous disease- 
associated genes have been identified, the 
cellular context and tissue environment in 
which these genes function are often opaque. 
Eraslan et al. analyzed muscle diseases as ex- 
amples of monogenic disorders in which mu- 
tations occur in myocyte-expressed genes. By 
relating these genes to cardiac, skeletal, and 
smooth muscle cells across a wide range of 
tissue types, the authors identified finer myo- 
cyte subsets that malfunction. For example, 
myocytes localized at the neuromuscular 
junction, but not those in other locations, are 
enriched with the expression of genes related 
to a type of neuromuscular transmission dis- 
ease. Additionally, ligand-receptor analyses 
between disease-associated myocytes and 
other cell types across tissues highlighted 
interrupted cellular interactions as a mecha- 
nism of muscular disease pathogenesis. For 
instance, mutations in the receptor tyrosine 
protein kinase ERBB3 may disrupt ligand- 
receptor signaling between myocytes and 
Schwann cells (which produce the myelin 
sheath around peripheral neurons) in a type 
of lethal muscular dysfunction. 

For complex diseases, loci that were iden- 
tified in genome-wide association studies 
(GWASs) were enriched in specific cell types 
involved in the tissue of disease manifestation 
and, intriguingly, were also enriched in simi- 
lar cell types residing in tissues unaffected by 
the disease. This is exemplified by the finding 
that coronary heart disease and heart rate 
loci were enriched in pericytes from breast, 
esophagus, and skeletal muscle in addition to 


the heart. These findings warrant further in- 
vestigation of tissue microenvironments and 
how the overall cellular interactions combine 
to induce disease pathogenesis. 

These pan-tissue human cell atlases form 
important reference datasets for understand- 
ing and predicting the side effects and safety 
issues of new medicines. Despite the increas- 
ingly targeted nature of new therapeutics, 
these agents are typically administered sys- 
temically, raising concerns about off-target 
effects (J0). For example, CD19* neuronal 
cells in the brain are associated with brain 
toxicities of CD19-targeted chimeric antigen 
receptor (CAR)-T cell therapies (17). Ideally, 
the on-target, off-tissue side effects can be 
predicted and monitored, instead of being 
discovered in the clinic. Thus, a pan-tissue 
query of the single-cell expression of a thera- 
peutic target would empower toxicity predic- 
tions before the first-in-human experiment. 

Collectively, these pan-tissue studies bring 
us closer to building a comprehensive hu- 
man single-cell atlas. Future analyses may 
include larger and diverse cohorts. It is also 
crucial to apply this approach in the setting 
of disease, such as cancer. Understanding 
both shared and tissue-specific features of 
tumorigenesis is key to the development of 
histology-agnostic and cancer type-specific 
therapeutics. For example, immunotherapies 
using programmed cell death protein 1 (PD1) 
antibodies have shown efficacy for multiple 
cancer types. Recently, pan-cancer single-cell 
analyses of tumor immune microenviron- 
ments have been reported for myeloid cells 
(12) and T cells (73) and should be broadened 
to include more comprehensive cell types 
and patient cohorts. Such initiatives will 
certainly spur a wave of technical advance- 
ments and biological insights. Ultimately, fu- 
ture understanding of human biology should 
encompass individual genes, cell types, and 
phenotypes in an integrated manner. 
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MICROBIOLOGY 


Recording bacterial responses to 
changes in the gut environment 


A CRISPR-based tool reveals intestinal microbiota 


gene expression through time 


By Liron Zahavi'? and Eran Segal? 


he bacterial population in the human 
gut is tightly linked with host health 
and has been implicated in a wide ar- 
ray of diseases, such as atherosclerosis, 
diabetes, and cancer (J, 2). Existing 
treatments that are intended to shape 
the microbiota to improve health—prebiotics, 
probiotics, and fecal microbiota transplanta- 
tions (FMTs)—have limited success in obtain- 
ing the desired microbial composition and in 
maintaining it over time (3, 4). This is largely 
because bacteria in the gut are affected by 
other bacteria, human physiology, diet, medi- 
cations, and more, and the relationships in 
this complex system are yet to be deciphered. 
Understanding the bacterial response and 
adaptation to these variables is essential for 
creating more effective microbiota-based 
interventions. On page 714 of this issue, 
Schmidt et al. (5) present a new tool that 
“documents” bacterial gene expression in 
mice and can clarify the bacterial response to 
perturbations in the intestinal environment. 
Most studies that investigate the effect of 
different factors on the gut microbiome are 
based on DNA profiling of fecal samples. 
Such studies compare the DNA content of 
microbiomes and associate differences with 
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environmental perturbations. This approach 
reveals changes in species composition or in 
bacterial genomics, reflecting the microbi- 
ome response on ecological and evolution- 
ary time scales, respectively. However, the 
most immediate bacterial response happens 
at a physiological time scale, which is not re- 
flected in DNA (6). One immediate means of 
adaptation to environmental changes is gene 
expression. Thus, RNA-based studies fill an 
important gap for understanding bacterial 
adaptation to host variables. 

Schmidt et al. introduce a method for 
studying microbial transcription. They dem- 
onstrate this method by colonizing mice 
with an engineered Escherichia coli strain, 
inserting the Record-seq system, which le- 
verages the recording nature of CRISPR-Cas 
(7). Within the Record-seq-engineered E. coli 
cells, transcripts are reverse transcribed to 
DNA and stored as spacers in a CRISPR array. 
As transcriptional records are stored as DNA 
sequences in the CRISPR array, bacterial 
gene expression history can then be sampled 
by DNA sequencing of host fecal samples 
(see the figure). They use this recording tool 
to investigate mechanisms of E. coli adapta- 
tion to varying host diets, cocolonization with 
another species, bacterial gene deletion, and 
host inflammation. 

They uncovered, for example, multiple 
metabolic genes whose expression was in- 
creased when the mice were fed a less diverse 
diet. They hypothesized that these genes al- 


Recording microbial responses in vivo 


Schmidt et al. engineered Escherichia coli cells to record their gene expression in the mouse intestine. Within the engineered cells, 
the Record-seq system incorporates reverse-transcribed transcripts as new spacers in a barcoded CRISPR array on plasmids. 
Sequencing mice fecal samples revealed the bacterial response to ongoing and past exposures, such as diet changes and 


intestinal inflammation. 
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low the bacteria to feed on host mucosal sug- 
ars and predicted that a bacterium lacking 
two of these genes will have a competitive 
disadvantage when the host is fed with a re- 
stricted diet, which they validate experimen- 
tally. This shows the potential of this method 
to highlight genes that allow bacteria to adapt 
to specific host conditions. These insights 
could be used to design an intervention that 
is based on bacteria that fit the host diet or 
to adjust host diet to allow certain bacteria 
to thrive. Similarly, they demonstrate the use 
of Record-seq to analyze the effect of intro- 
ducing another species on E. coli gene expres- 
sion and deduce that the bacteria shift their 
metabolism to exploit new niches created by 
the other species. Such experiments can be 
informative about bacteria-bacteria interac- 
tions in a mechanistic resolution that species 
co-occurrence measurements cannot. 

The ability to look at microbial gene 
expression is not new. RNA-sequencing 
(RNA-seq)-based metatranscriptomics is a 
common tool with which to investigate the 
microbiome (8). There are, however, consid- 
erable advantages to Record-seq over RNA- 
seq. Although RNA-seq shows the immediate 
response of gut bacteria to its environment, 
it only reflects a snapshot of the microbiome 
at the moment of the sample. Because of the 
short lifetime of mRNA molecules, samples 
need to be taken at the exact moment of 
the response to investigate transient states. 
Record-seq, however, is distinctive in that it 
archives transcripts in the engineered E. coli 
DNA over time and thus describes a series 
of states. When Schmidt et al. changed the 
diets fed to mice, Record-seq identified signs 
of the previous diet after more than a week, 
whereas the signal was rapidly lost with 
RNA-seq analysis of fecal samples. 

In a different experiment, Schmidt et al. 
demonstrate another advantage of their 
method: the ability to attribute transcripts to 
a specific CRISPR array and, therefore, bac- 
terial strain. In metatranscriptomics, RNA is 
extracted from fecal samples 
and represents the wide array 
of species colonizing the gut. 
Attributing mRNA molecules 
to genomes depends on the 
specificity of the short mRNA 
sequence to a single genome. 
Schmidt et al. exemplify the 
importance of this limitation. 
They barcoded two CRISPR 
arrays and transformed the 
recording plasmids into E. 
coli cells of two strains. One 
was a wild-type strain, and 
the other had one gene de- 
leted. They cocolonized the 


Microbiome 


output two strains in mice and stud- 
ROO Ye __ ied the effect of the deletion 


13 MAY 2022 « VOL 376 ISSUE 6594 697 


INSIGHTS | PERSPECTIVES 


on gene expression of that strain compared 
with the wild type, showing the dynam- 
ics between the two similar strains in vivo. 
This analysis could not have been done with 
metatranscriptomics. 

The archive quality of Record-seq could 
be leveraged for clinical applications. In dia- 
betes mellitus, following the patient’s blood 
glucose concentration is important for as- 
sessing disease progression and the effec- 
tiveness of the treatment. Although blood 
glucose concentration indicates whether the 
patient is in hyperglycemia at the moment 
of sampling, there is a measure—percentage 
of glycated hemoglobin (%HbAIlc)—that re- 
flects the glycemic state of the patient over 
the past 3 months. This test is invaluable 
for the diagnosis and management of diabe- 
tes, exemplifying the importance of a single 
test that captures longitudinal information. 
Other chronic diseases whose management 
depends on lifestyle adjustments can benefit 
from such a test. Schmidt et al. show that 
Record-seq-engineered sentinel cells can 
identify the existence and severity of intesti- 
nal inflammation in a colitis mouse model. 
Celiac patients, for example, could benefit 
from such a system of bacterial sentinel cells 
that monitors their adherence to dietary 
restrictions and the state of their disease. 
Furthermore, if successfully adapted to be 
used in human microbiota, this system can 
be used to design an assay for diet effects on 
inflammatory bowel disease (IBD) to develop 
a personalized nutritional plan to minimize 
intestinal inflammation in patients (9). 

The method that Schmidt et al. intro- 
duce opens new avenues for studying the 
microbiota response to environmental fac- 
tors and paves the way toward the devel- 
opment of more effective microbiota-based 
treatments. Two of the major advantages 
of their method—its ability to “document” 
responses in changing states and to associ- 
ate outputs with specific strains within a 
complex community—are very much miss- 
ing from other microbiota -omics research 
methods. Proteomics- and metabolomics- 
based research methods, which cover other 
important layers in microbiota dynamics 
and its interactions with the host, would 
benefit from similar improvements. 
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Beating natural proteins 


at filtering water 


Artificial fluorous channels outperform 
aquaporins in water permeation 


By Yuexiao Shen 


umans have long drawn inspiration 

from nature to create tools that can 

mimic a specific function. In recent 

decades, researchers have taken this 

philosophy to the nanoscopic level, 

such as in the creation of synthetic 
structures for water purification inspired by 
biological systems. Aquaporins are a type of 
naturally occurring protein known as water 
channels, which have notable water perme- 
ability and selectivity (1). On page 738 of 
this issue, Itoh et al. (2) report the design 
of fluorous channels—structures made out 
of fluorocarbons—that can self-assemble 
from oligoamide nanorings and outperform 
biological water channels in permeability. 
Because of the superhydrophobic interior 
surface endowed by densely covered fluo- 
rine atoms, these nanochannels exhibit a 
water permeation flux that is two orders of 
magnitude greater than that of aquaporins. 


Nature evolves ideal water-selective chan- 
nels, including aquaporins, that are embed- 
ded in cell membranes to regulate osmotic 
balance and make critical biophysical con- 
tributions to living organisms. Aquaporins 
form an hourglass-shaped narrow channel 
with a ~0.3-nm aperture and a hydropho- 
bic interior surface, resulting in notable 
water permeability and solute rejection (J). 
However, these energy-efficient structures 
have been difficult to mimic. The first arti- 
ficial water channels could accommodate 
water molecules, but the mobility was slow 
and there was little rejection of other small 
molecules (3, 4). Later synthetic structures 
maintained high water conductance (5-8) 
while also rejecting salts, including NaCl 
(6-9), and even protons (7, 8). 

When water flow is confined to a sub- 
nanometer channel with hydrophobic 
interior surfaces, classical hydrodynam- 
ics is no longer applicable. The clustered 
arrangement of water molecules breaks 


A frictionless water channel 


Itoh et al. designed a nanochannel with a superhydrophobic interior 
surface that beats the naturally occurring protein aquaporin at its 
own game. The stacked nanorings form a nanochannel where water 
clusters are broken down into a single file as they flow through. 
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The ring is fabricated by condensing fluorine-containing amide 
monomers. The electronegative fluorine (F) atoms interact with 
the adjacent hydrogen atoms, which stabilizes the ring structure. 
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down and leads to the formation of single- 
file water “wires” within channels. This 
single-file arrangement, in combination 
with an increase of frictionless “slip flow” 
at the channel walls, is believed to be the 
key factor that enables aquaporins and 
their mimics to achieve extremely fast wa- 
ter flow (10). Itoh et al. created a new class 
of synthetic channels that was inspired 
by the properties of Teflon, a well-known 
superhydrophobic fluorous material. They 
developed a series of fluorine-based mac- 
rocyclic channels whose water perme- 
ability exceeds that of aquaporins by more 
than 100-fold. 

To create confined fluorinated surface 
chemistry at the nanoscale, Itoh et al. 
oligomerized fluorine-containing amide 
monomers through condensation reac- 
tions (see the figure). They report that 
because of their high 
electronegativity, fluorine 
atoms electrostatically in- 
teract with the hydrogen 
atoms from the adjacent 
polar amide groups and 
form hydrogen bonds. This 
makes the carbon-fluorine 
bonds point inward and 
the entire macrocyclic ring 
rigid, resulting in a series 
of fluorous oligoamide na- 
norings with pore diam- 
eters ranging from 0.9 to 
1.95 nm. After these rings 
are stacked on top of each 
other within a lipid membrane, the nano- 
channel’s interior surface is densely coated 
with fluorine atoms. 

The superhydrophobic fluorinated sur- 
face enhances the breakdown of water 
clusters, as demonstrated by molecular 
dynamics simulations, thus substantially 
improving the water transport rate. Not 
surprisingly, these fluorous channels were 
experimentally shown to exhibit two to 
three orders of magnitude greater water 
permeability (5.5 x 107° cm’ per second 
per channel) than both aquaporins and 
recent aquaporin mimics, including syn- 
thetic channels and carbon nanotubes. In 
addition to exceptional water permeability, 
these fluorous channels also offer excel- 
lent rejection of anions such as chloride 
because of the electrostatically negative 
surface. This means that potential future 
applications such as desalination and wa- 
ter purification are possible. A key feature 
of aquaporins—proton exclusion combined 
with high water transport—was not dem- 
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“They developed a 
series of fluorine- 
based macrocyclic 
channels whose 
water permeability 
exceeds that of 
aquaporins by more 
than 100-fold.” 


onstrated by Itoh e¢ al., but this is less rel- 
evant to water-purification applications. 

It is exciting to see that by exercising 
exquisite control of synthetic chemistry, 
it is possible to engineer synthetic chan- 
nels at the nanoscale to achieve superfast 
water transport and go beyond the per- 
formance of aquaporins. However, before 
this technology can be used in real-world 
applications, there are a few challenges 
to overcome. Even if these newly devel- 
oped channels have exceptional transport 
properties, the low synthesis yield (<0.2%) 
could be a hurdle for scalable applications. 
There is also the practical challenge of 
incorporating the channels into filtration 
materials, that is, whether they can be 
used as additives in the traditional mem- 
brane fabrication process to enhance the 
performance (17) or whether their use re- 
quires a complete rethink 
of the manufacturing pro- 
cess. Researchers need to 
not only reengineer ma- 
terials at the microscopic 
scale to improve macro- 
scopic performance but 
also find solutions by com- 
bining material innovation 
with scalable approaches 
(12). If the goal is to make 
membrane separation ma- 
terials, future research will 
need to focus on imple- 
menting the concept of 
fluorous channels in the 
structure of scalable polymer membranes, 
for example, to figure out how to promote 
water transport within the polymer matrix 
and enhance membrane toughness such as 
chlorine resistance. Also, the field has con- 
verged on the tenet that selectivity is still 
the foremost aspect when considering syn- 
thetic channel design because critical sep- 
arations in the subnanoscale range are en- 
abled by increasing selectivity rather than 
permeability (13). The fluorous channels 
reported by Itoh et al. bring more diverse 
chemistry and possibilities for the design 
of synthetic channels and separation mate- 
rials in the future. 
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QUANTUM PROCESSES 


Anomalous 
fluid flow 
in quantum 
systems 


New regimes of 
hydrodynamics are 
probed with synthetic 
quantum matter 


By Alan Morningstar and Waseem Bakr 


ecent progress in cooling, control- 

ling, and measuring a small number 

of isolated atoms or ions has led to 

the creation of experimental plat- 

forms with which quantum phenom- 

ena can be realized and explored 
in fine detail. In quantum simulation ex- 
periments, idealized theoretical models are 
implemented on these platforms to study 
them experimentally in otherwise inacces- 
sible regimes. For example, a set of many in- 
teracting quantum “spins” can be simulated 
in the experiment, and the collective fluid- 
like dynamics they exhibit can be measured. 
The theoretical framework describing this 
fluid flow, known as hydrodynamics, is said 
to “emerge” from the basic laws that gov- 
ern the system. How exactly this happens, 
and what emergent dynamics are possible, 
are questions at the frontier of physics. On 
pages 720 and 716 of this issue, Joshi et al. 
(1) and Wei e¢ al. (2), respectively, report on 
observations of unusual types of hydrody- 
namics in quantum simulations. 

Despite a difference in the underlying 
experimental platforms, both experiments 
are simulations of one-dimensional (1D) 
systems of quantum spins, each with two 
states: up and down. Pairs of spins can 
exchange their states by interacting but 
cannot independently flip. As a result, the 
difference in the total number of up spins 
and down spins—the total magnetization— 
does not change under the dynamics. 

However, as a collective effect, magne- 
tization does flow within these systems. 
The most common such hydrodynamics is 
called diffusion, which is the type of flow 
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that results from spin moving 
through the system with random 
short-range hops. In diffusion, 
a local excess of magnetization 
decays slowly with time accord- 
ing to a dynamical exponent, 
commonly denoted as the letter 
g, with a value of 2. By contrast, 
a system in which spin streams 
through uninterrupted is called 
“ballistic” and hosts a faster de- 
cay, with a 2 value of 1. Joshi et al. 
and Wei et al. present different 
experiments in which anomalous 
hydrodynamics and intermediate 
“superdiffusion” are observed, 
corresponding to values of 2 be- 
tween 1 and 2 (see the figure). 

Joshi et al. use a chain of 
trapped ions to simulate a 1D spin 
system with long-range spin ex- 
change interactions. The param- 
eter a, which characterizes the 
decay of the strength of the long- 
range interactions as a function of 
distance, is tunable in their exper- 
iment. Because of these interac- 
tions, one could expect the hydro- 
dynamics of magnetization to be 
the same as in a system in which 
spin randomly hops around, simi- 
lar to diffusion, but with a distri- 
bution of hop sizes that matches 
the long-range interactions. This 
behavior is known as a “Lévy 
flight” (3). In Lévy flights, when 
large hops are common enough, 
they become relevant and cause 
the flow of magnetization to be- 
come anomalous, with a z value 
that is <2 and changes as the pa- 
rameter a is tuned (4). 

To test the Lévy flight theory, 
Joshi et al. prepare a set of initial 
states with all spins in the chain 
set randomly up or down, except for the 
spin at the center of the chain, which is ini- 
tially set to up. Using this setup, they mea- 
sure how the central excess magnetization 
spreads through the chain over time. This 
is done for several values of the interac- 
tion parameter a, and the dependence of 
the measured dynamical exponent zg on a 
is consistent with the Lévy flight picture. 

In a separate work, Wei et al. use ul- 
tracold atoms to simulate a_ so-called 
“Heisenberg” spin chain with short-range 
interactions. The initial states they pre- 
pared were “domain walls,” in which one 
half of the chain is spin up and the other 
half is spin down. As spin is exchanged be- 
tween the two halves and the system ap- 
proaches equilibrium, the authors measure 
the net magnetization transferred across 


Spin density 


Spin density 
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Half the spins were set to up. 


Superdiffusion in quantum systems 


How does anomalous fluid flow arise in one-dimensional quantum systems? 


Trapped ion quantum simulator 


Joshi et al. simulated the hydrodynamics of a spin chain with long-range 


interactions using a string of ions. 
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The rest of the spins 
were set to random. 


The center spin 
was set to up. 


Time 


2a 


The evolution of the system is described by a "Lévy flight," which is a 
random walk with steps drawn from a distribution with heavy tails. 


Neutral atom quantum simulator 


Wei et al. studied superdiffusion in a Heisenberg chain by measuring the 


relaxation dynamics of a domain wall in the spin chain. 
Domain wall 


The evolution of the system exhibits connections to the Kardar-Parisi- 
Zhang equation, which is used to understand stochastic surface growth. 


the initial boundary, which allows them to 
characterize the flow of magnetization in 
the system. 

The Heisenberg spin chain has a cer- 
tain rotational symmetry and a special 
property known as integrability. These 
two ingredients are predicted to result in 
a specific type of superdiffusion with the 
dynamical exponent zg of 3/2. Wei et al. 
demonstrate this superdiffusion in their 
system and probe it further by destroying 
it in two distinct ways. First, they tune the 
total magnetization away from zero, break- 
ing the rotational symmetry, which causes 
the system to become ballistic: z goes to 1. 
Then, they leave the symmetry intact but 
break the integrability, which leads to dif- 
fusion: 2 goes to 2. In this way, the authors 
provide experimental evidence that it is 


bddHooo> 


The other half were set to down. 


the combination of integrability 
and symmetry that gives rise to 
superdiffusion in the Heisenberg 
spin chain (5). Exploration of 
these dynamics is motivated by an 
unexpected connection to a class 
of systems called the Kardar- 
Parisi-Zhang (KPZ) universality 
class, which describes stochastic 
surface growth dynamics but has 
no obvious relation to quantum 
dynamics (6-11). 

Both Joshi et al. and Wei et al. 
show that quantum simulation ex- 
periments are a valuable approach 
to exploring the dynamics of quan- 
tum systems. Each set of authors 
observe and characterize an anom- 
alous emergent hydrodynamics 
and are able to tune its properties 
by varying the underlying experi- 
mental parameters. Although both 
experiments result in superdiffu- 
sion, the underlying mechanisms 
are shown to be very different. In 
the case of Joshi et al., the mecha- 
nism seems to be well understood, 
and the experiment is a crisp re- 
alization of emergent Lévy flight- 
like dynamics in a quantum sys- 
tem, whereas in Wei et al., there 
is still some mystery surrounding 
the connection to KPZ. 

Although hydrodynamics is 
a nonquantum framework, the 
underlying properties that allow 
these experiments to be done 
are quantum mechanical. It will 
be interesting to see what other 
types of emergent dynamics will 
arise under the “quantum condi- 
tions” found in modern quantum 
simulation experiments. As these 
simulators continue to push past 
the capabilities of first-principles 
theoretical methods, they are sure to move 
the study of quantum dynamics into an ex- 
citing data-rich regime. ® 
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CLIMATE POLICY 


Investor-state disputes threaten 
the global green energy transition 


Global action on climate change could generate upward 
of $340 billion in legal claims from oil and gas investors 


By Kyla Tienhaara’’, Rachel Thrasher’, 
B. Alexander Simmons’, Kevin P. Gallagher? 


o limit global warming to below 1.5°C, 
governments will have to simultane- 
ously curb demand for fossil fuels 

and limit supply (2). However, efforts 

to limit supply will affect asset hold- 

ers, particularly in the upstream (ex- 
ploration and production) and midstream 
(transportation and storage) portions of the 
supply chain. Demands for compensation 
will ensue, and when the companies involved 
are foreign, legal claims may be brought to 
international arbitration in a process known 
as investor-state dispute settlement (ISDS) 
(see the first figure). The Intergovernmental 
Panel on Climate Change (IPCC) has recently 
acknowledged that ISDS cases could lead to 
states refraining from, or delaying, measures 
to phase out fossil fuels (2). Here, we illus- 
trate legal and financial risks associated with 
limiting oil and gas production and argue 
that governments should take steps to pre- 
vent fossil fuel investors from accessing ISDS. 
The International Energy Agency (IEA) 
Net-Zero Emissions Scenario (NZE) by 2050 
has no new investments in fossil fuel sup- 
ply after 2021 (3). In a 1.5°C carbon budget, 
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nearly 60% of oil and fossil methane gas 
reserves and 90% of coal reserves cannot 
be extracted (4), suggesting that even some 
existing operations are unviable. Although 
many government plans for fossil fuel pro- 
duction are inconsistent with their commit- 
ments under the Paris Agreement (5), the 
launch of the Beyond Oil and Gas Alliance 
(BOGA) in 2021, by 11 national and subna- 
tional governments pledging Paris-aligned 
dates for ending oil and gas production, sug- 
gests that the tide may be turning. At the 
same time, a UK firm, Rockhopper, is using 
ISDS to seek an undisclosed amount over 
Italy’s denial of a coastal drilling permit, and 
a Canadian firm, TC Energy, is claiming more 
than $15 billion in compensation for US Pres- 
ident Biden’s cancellation of the Keystone XL 
Pipeline (table S1). 


PROTECTIONS FOR FOREIGN INVESTORS 

Thousands of bilateral and plurilateral in- 
vestment treaties have been signed by states 
over the last 50 years (6). Investment treaties 
act as a form of political risk insurance and 
were presented to governments in the Global 
South as a means to increase foreign invest- 
ment flows. Under these treaties, foreign 
investors are protected from certain forms 
of state conduct but have no responsibili- 
ties themselves. Investors are empowered to 
bring claims that a state has breached a treaty 
directly to an ad hoc ISDS tribunal composed 


of three members—one chosen by the inves- 
tor, one by the state, and one that is mutually 
agreed upon or appointed by an arbitration 
body such as the International Centre for the 
Settlement of Investment Disputes (ICSID). 
Foreign investors have relied on various 
investment treaty rules to claim that policies 
to protect the environment, public health, or 
the economy are discriminatory, undermine 
their “legitimate expectations,’ or impose a 
burden that is not justified or proportional 
to the state’s policy objectives (7) (table S2). 
If itis determined that a state has breached 
an investment treaty, the tribunal will typi- 
cally require the government to pay the in- 
vestor compensation. Increasingly, there is 
a tendency for tribunals to use projections 
of an investment’s expected future income 
across its entire life cycle as the basis of com- 
pensation, using discounted cash flow as the 
method of calculation (8). The result has been 
hugely inflated monetary awards in the hun- 
dreds of millions and even billions of dollars. 
In some cases, the amount of compensation 
awarded has been described as “crippling” 
because it is incommensurate to the state’s 
capacity to pay (9). Governments don’t al- 
ways lose, but the rules are sufficiently vague 
to make it difficult to predict outcomes. This, 
combined with the risk of a very large award, 
can create a chilling effect. Critically, despite 
a growing body of research (JO), there is a 
lack of understanding on the part of many 
states that the treaties do not live up to their 
purported benefit of facilitating investment. 


ISDS AND THE FOSSIL FUEL SECTOR 

Law firms are advising investors that are 
“most affected by States’ climate change ob- 
ligations” (i.e., those in the fossil fuel sector) 
to (re)structure their investments to ensure 
access to ISDS (17). To date, there have been 
at least 231 ISDS cases related to fossil fuel 
investments, which is almost 20% of the total 
known number of cases across all sectors (72). 
The vast majority (92%) of fossil fuel cases 
are related to oil and gas investments (12). 
Fossil fuel investors have been successful in 


The stages of an investor-state dispute settlement 
Foreign investors may have a legal claim for compensation if countries cancel treaty-protected fossil fuel projects where investors have already obtained at least 

an exploration permit. If the lawsuit is successful, countries face substantial financial losses and may pull out of climate commitments to avoid additional ISDS claims 
from other investors. Colors of each stage correspond to the primary actor(s) involved: the investor (red), state (blue), tribunal (gray), or multiple actors (purple). 


Action from 

the state 

The state decides 
to phase out future 
or existing fossil 
fuel—based projects 
to align with their 
new climate 
commitments. 


Investment treaties 
and concessions 
States sign investment 
treaties to encourage 
foreign investment and 
allocate concessions 
to fossil fuel investors 
years in advance of the 
investment's operation. 
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Notice of 
arbitration 

Foreign investor of 

an affected project 
notifies the state 

of a potential breach 
of its commitments 
under an international 
investment agreement. 


Selection of 
arbitrators 

Three lawyers are 
appointed by the 
state and foreign 
investor as arbitrators 
in the investor-state 
dispute settlement 


(ISDS). future profits. 


Tribunal proceedings 
Arbitrators examine if 
the state undermined 
the investor's “legitimate 
expectations” or the 
value of their investment 
and estimate losses, 
stranded assets, or 


Judgment 
rendered 
Arbitrators determine 
if the state violated 
its previous 
commitments and 
owes compensation 
to the injured 

foreign investor. 


Troubling 
feedbacks 

Expensive awards or 
settlements could lead 
to states pulling out of 
climate commitments 
and rolling back 
regulations out of fear 
of more ISDS cases. 
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72% of cases that were decided on 
the merits where the final award 
was disclosed (12). Fossil fuel cases 
have a high average amount of 
compensation awarded ($600 mil- 
lion) (72), and asset valuation is 
complicated by the volatility of oil 
prices, which are highly sensitive 
to exogenous shocks such as the 
pandemic and the war in Ukraine. 
Although only a small number 
of ISDS cases to date concern cli- 
mate actions (table S1), this will 
likely change if governments be- 
gin to adopt more stringent poli- 
cies, particularly if those policies 
directly affect fossil fuel investors 
(table S3). States can argue that 
investors should have anticipated 
that they would be affected by cli- 
mate policies, given long-standing 
scientific consensus that fossil 
fuels are the primary source of 
greenhouse gas emissions and in- 
ternational agreements on climate 
change mitigation dating back 30 
years. Nevertheless, it remains 
unclear how tribunals will weigh 
generalized commitments by 
states to address climate change 
against specific commitments to 
further oil and gas production 
(e.g., through issuance of leases). 
Countries with a large share of 
fossil fuel assets under the protec- 
tion of foreign investment treaties 


Treaty-protected fossil fuel projects could delay climate action 

(Top) Extent of the world’s oil and gas projects without a final investment decision that are protected by foreign 
investment treaties, and their estimated production volume in millions of barrels per day (MMbbl/day). (Bottom) 
Estimated total net present value (NPV) of all treaty-protected oil and gas projects without a final investment decision 
across four oil price scenarios, including the average total NPV per country across all scenarios, and the share of the 
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the NZE and cancel all fossil fuel 
projects that do not have a final 
investment decision as of 31 De- 
cember 2021 (3), foreign investors that have 
already obtained at least an exploration per- 
mit by this date would have grounds for an 
ISDS claim (as in Rockhopper v. Italy and 
Lone Pine v. Canada; table S1). States could 
take more aggressive action and additionally 
cancel projects that have a final investment 
decision but are not yet producing (i.e., under 
development), which could give rise to more 
ISDS claims. 

To analyze risks in these scenarios, consid- 
ering both the existing caseload of fossil fuel 
industry ISDS claims and the availability of 
comprehensive project-level data, we chose 
to focus exclusively on upstream oil and gas. 
We utilized Rystad Energy’s UCube database, 
which contains a list of countries’ crude 
oil, gas, condensate, and natural gas liquid 
(NGL) assets and their attributes, including 
the production status, real or anticipated ap- 
proval year, country headquarters of foreign 
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investors, production volume per investor, 
and estimates of the net present value (NPV) 
per investor. We merged this dataset with in- 
formation on 1828 investment treaties that 
provide access to ISDS (6). To illustrate the 
potential range of financial compensation for 
treaty-protected projects under uncertainty 
of future oil and gas prices, we used four NPV 
estimates: one using Rystad Energy’s tempo- 
rally dynamic NPV prediction, and three as- 
suming a constant price of industry-standard 
Brent oil (and gas equivalent) of $50, $75, 
and $100 per barrel (see supplementary ma- 
terials for detailed methods). 


BILLIONS IN COMPENSATION 

UCube lists 55,206 upstream oil and gas 
projects in 159 countries that are predicted 
to have a final investment decision made 
during 2022-2050 but for which at least 
an exploration permit had already been ob- 
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tained by investors. If countries cancel these 
projects in line with the NZE, ISDS claims 
are possible for at least 10,506 (19%) of these 
projects across 97 countries, which are cur- 
rently protected by 334 different invest- 
ment treaties. This represents 18 to 21% of 
all crude oil, gas, condensate, and NGL as- 
sets and 8 to 11% of the total estimated pro- 
duction of all oil and gas projects awaiting 
a final investment decision (see the second 
figure). The total NPV of these protected 
projects is expected to reach $60 billion to 
$234 billion, depending on future oil prices. 
Valuations for crude oil production are the 
greatest ($20-114 billion), followed by gas 
($20-90 billion), condensate ($7-21 billion), 
and NGL ($2-8 billion) (see the second fig- 
ure). This translates to tremendous potential 
financial losses for countries across income 
levels if ISDS claims are successful. The five 
countries with the greatest potential losses 
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from ISDS are Mozambique ($7-31 billion), 
Guyana ($5-21 billion), Venezuela ($3-21 bil- 
lion), Russia ($2-16 billion), and the United 
Kingdom ($3-14 billion). 

If countries decide to also cancel oil and 
gas projects that are currently under devel- 
opment, this could introduce substantial ad- 
ditional financial losses from ISDS claims. Of 
the 1941 oil and gas projects under develop- 
ment in the last decade across 62 countries, 
225 are protected under 81 foreign invest- 
ment treaties. Although this represents a 
smaller share of all oil and gas projects (12%) 
and production volume (9%) under devel- 
opment, potential valuations remain high. 
The total estimated NPV ranges from an ad- 
ditional $32 billion to $106 billion, with gas 
($12-52 billion) and crude oil ($14-42 bil- 
lion) constituting the largest share. In this 
scenario, 33 countries would be vulnerable to 
additional ISDS claims, most notably Russia 
($5-23 billion), Mozambique ($5-19 billion), 
Kazakhstan ($6-18 billion), Guyana ($2-6 
billion), and Indonesia ($3-4 billion). 

The Energy Charter Treaty (ECT) is the 
greatest contributor to these potential ISDS 
claims. The ECT, the only investment treaty 
with an exclusive focus on energy, has been 
ratified by 50 countries, mostly in Europe 
(table S4). The ECT applies to 19% of all 
treaty-protected oil and gas projects without 
a final investment decision, and 13% of those 
projects are only covered by the ECT. This 
represents 7% of the production volume of all 
treaty-protected projects and 6 to 7% ($3-16 
billion) of the total NPV of those projects. For 
oil and gas projects that are under develop- 
ment, 6% of all treaty-protected projects are 
only covered by the ECT, which constitutes 
4 to 6% ($2-4 billion) of the total valuation. 


AN ABOLITIONIST APPROACH 

International treaties that provide protec- 
tion to investors and access to ISDS in- 
crease the power of the fossil fuel industry 
to resist the implementation of policy mea- 
sures that are crucial for the global green 
energy transition. When investors launch 
claims for compensation under ISDS, or 
threaten to do so, this can result in the de- 
lay of policy action, extension of deadlines 
for moratoria, or exemptions and payment 
of far greater compensation to leasehold- 
ers than would be available under domes- 
tic remedies. For example, fears of ISDS 
prevented New Zealand from becoming a 
full member of BOGA and committing to a 
Paris-aligned phase-out of oil and gas (13). 
Ultimately, countries must decide if the fi- 
nancial risks of immediate climate action 
outweigh the societal risks from delayed cli- 
mate action. At the very least, governments 
should immediately halt oil and gas explo- 
ration. This would directly align with global 
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climate targets, and set an immediate cap 
on the extent of ISDS compensation claims 
countries might face in the future. 

More direct actions will be necessary to re- 
duce existing ISDS threats without limiting 
national climate ambitions. The Organisa- 
tion for Economic Co-operation and Devel- 
opment is conducting work on how this can 
be achieved (/4). Governments should take 
measures to prevent existing leaseholders 
from accessing ISDS, such as terminating 
all bilateral investment treaties (10). Several 
countries, including South Africa, have done 
so, without any resulting reductions in for- 
eign investment. States can also negotiate the 
removal of ISDS clauses from trade agree- 
ments, as the United States did with Canada 
in the US-Mexico-Canada Agreement. An- 
other option is for states to withdraw consent 
to ISDS in cases involving fossil fuel invest- 
ments, emulating the approach taken by 
Singapore and others to remove the threat of 
ISDS claims from the tobacco industry. 

Our proposals are more efficient and effec- 
tive than trying to fix the investment treaty 
system through procedural reforms or the 
introduction of exceptions clauses for cli- 
mate policy. Reformist approaches would be 
time-consuming and likely ineffectual, based 
on the experience of previous efforts. Never- 
theless, there are two major obstacles to our 
abolitionist approach. The first is “sunset 
clauses” that allow for continuation of trea- 
ties for 10 to 20 years for investments com- 
menced prior to termination. Sunset clauses 
can be nullified through mutual agreement 
of the parties. The second obstacle—resis- 
tance to treaty termination or modification 
from states with powerful fossil fuel lobbies— 
could prove more difficult to overcome. 

These issues are playing out in discus- 
sions around the “modernization” of the 
ECT. Given that the ECT protects far more 
oil and gas production than any other 
treaty, it should be prioritized for termina- 
tion. This could reduce the global price tag 
of oil and gas project cancellations by $5 
billion to $20 billion, or more. Yet although 
several European members are consid- 
ering termination or coordinated with- 
drawal, 32 additional countries have begun 
the process to accede to the treaty (table 
S4). Expanding the ECT to these countries 
would increase the number and produc- 
tion volume of treaty-protected oil and gas 
projects without a final investment deci- 
sion by 6% and 25%, respectively, and the 
total NPV of these new protected projects 
could increase the global ISDS price tag by 
19 to 24% ($12-45 billion). Terminating the 
ECT would thus substantially reduce exist- 
ing financial risks and eliminate additional 
future risks from expansion. 

Although wealthy nations, such as the 


United Kingdom, Denmark, and _ Ireland, 
would benefit most from the avoided finan- 
cial losses that would come with ECT termi- 
nation, the threat of ISDS claims through 
other treaties for low- and middle-income 
countries would remain a major concern, 
particularly for countries vulnerable to se- 
vere climate change impacts and struggling 
to escape the debt crisis. The high end of our 
liability estimate ($340 billion) is more than 
the total level of public climate finance glob- 
ally in 2020 ($321 billion) (75). Potential liti- 
gation risks are even greater if coal mining 
and midstream fossil fuel infrastructure are 
considered. We cannot afford to divert such 
a substantial amount of public finance from 
essential mitigation and adaptation efforts to 
compensate investors in fossil fuels. 
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Rethinking genetic disease 


Carriers of the fragile X premutation can experience 
medical conditions of their own 


By Bill Sullivan 


tis widely assumed that being a carrier 
of a genetic disease means that only 
one’s children are at risk of develop- 
ing a medical condition. But there is at 
least one striking exception to this rule. 
The genetic aberration causing fragile 
X syndrome, which is marked by severe 
cognitive and behavioral problems, includ- 
ing autism, accumulates over time across 
generations. Seemingly unrelated 
medical issues have plagued par- 
ents and grandparents of indi- 
viduals with fragile X syndrome. 
In The Carriers, psychiatrist Anne 
Skomorowsky tells their story and 
how doctors pieced together this 
complex hereditary puzzle. 
Through absorbing case reports, 
Skomorowsky lifts the curtain on 
the bewildering array of maladies 
that fragile X carriers may endure. 
She was moved to learn more after 
seeing Jessie, a 22-year-old patient who was 
struggling with premature menopause and 
worried about her ailing father, who was de- 
veloping tremors and having trouble walking. 
Surprisingly, a genetic test revealed that the 


Columbia 


The reviewer is the author of Pleased to Meet Me: Genes, 
Germs, and the Curious Forces That Make Us Who We Are 
(National Geographic Books, 2019). Email: wjsulliv@iu.edu 


704 13 MAY 2022 » VOL 376 ISSUE 6594 


The 
CARRIERS | 


The Carriers 
Anne Skomorowsky 
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pair’s disparate medical problems were caused 
by a mishap at the same gene: fragile X mental 
retardation 1 (FMRI). 

The FMRI gene makes a protein (FMRP) 
necessary for the proper development and 
function of neurons in the brain. In most 
patients with fragile X syndrome, the FMRI 
gene has been shut down, which means 
FMRP is no longer made. 

Like all genes, FMRI is preceded by a 
stretch of DNA bases that regulate its ac- 
tivity. The DNA sequence upstream 
of FMRI contains repeats of three 
nucleotides (cytosine and two 
guanosines, or CGG). Most people 
have 30 to 40 CGG repeats preced- 
ing FMRI. Those harboring 55 to 
200 CGG repeats are considered 
premutation carriers and are at 
risk of having children whose FMRI 
gene is epigenetically silenced by 
DNA methylation. 

The Carriers focuses on the 
little-told story of those with the 
premutation, like Jessie and her father, who 
suffer from fragile X-associated primary 
ovarian insufficiency and tremor/ataxia 
syndrome, respectively. Skomorowsky deftly 
interweaves science and narrative, explain- 
ing how the variations in CGG repeats can 
lead to distinct, adult-onset disorders in dif- 
ferent sexes and ages or produce “a touch of 
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fragile X syndrome” in some carriers. She 


Parker Roos, who has fragile X syndrome, 
dyes eggs with his family in 2012. 


rightly points out that the difficulties suf- 
fered by some carriers, especially those who 
experience diminished executive function, 
make taking care of a fragile X child even 
more challenging. 

The heroic efforts of astute researchers 
such as Randi Hagerman at the Medical 
Investigations for Neurodevelopmental 
Disorders (MIND) Institute at the 
University of California, Davis, who have 
sought to understand these conditions, read 
like a fast-paced detective story and have 
ramifications beyond medicine. Problems 
with behavior, aggression, and self-control, 
we are reminded, can stem from factors be- 
yond an individual’s control. Skomorowsky 
observes: “It wasn’t just one damn thing af- 
ter another. It was genes.” 

The chapter describing the so-called vil- 
lage of fools is among the more fascinating 
instances of the drastic impact that fragile 
X and its carriers can have on a community. 
The disproportionate amount of strange 
behavior and illness among the residents 
of Ricaurte, Colombia, is legendary, stretch- 
ing back to its foundation in the 1800s. 
The medical mystery underlying this phe- 
nomenon was recently solved by physician 
Wilmar Saldarriaga Gil, whose keen sense 
of observation was second only to his com- 
passionate curiosity. 

Some may be disappointed that 
Skomorowsky does not evaluate potential 
future treatments for genetic diseases like 
fragile X and its premutations, although 
gene therapy has been covered extensively 
elsewhere. More mechanistic details into 
the function of FMRP and a description of 
the attempts underway to compensate for 
its deficiency with experimental therapeu- 
tics would have also been welcomed. 

Two key takeaways emerge from 
Skomorowsky’s enlightening account. The 
first is that fragile X and its associated pre- 
mutations illustrate the important role that 
genomic analyses can play in understand- 
ing cognitive abilities and behavior, high- 
lighting, in particular, that variations in in- 
tergenic regions—often dismissed as “junk 
DNA’—should not be ignored. 

Second, The Carriers underscores the 
value of being patient with patients—atten- 
tive listening to their symptoms and fam- 
ily history can reveal clues needed to solve 
clinical mysteries. In this sense, patients 
and doctors work like two sides of the same 
coin: “When doctors listen closely to their 
patients, with enough care, they are not 
just being kind. They are part of a process 
of discovery.” & 
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NEURODEVELOPMENT 


The becoming of the human brain 


A neuroscientist traces the development of the body’s most complex organ 


By R. Douglas Fields 


othing in biology is more miraculous 

than the transformation of a fertilized 

egg into a complete organism. Con- 

tained within this process are all of 

genetics, the core of cell biology, and 

the imprints of evolution. The human 
brain in particular—formed from a teeming 
ball of embryonic cells—transcends anatomy 
to generate identity, personality, and com- 
prehension, which make each new life differ- 
ent from all others. How the nervous system 
develops is the subject of neuroscientist Wil- 
liam A. Harris’s new book, Zero to Birth: How 
the Human Brain Is Built. 

Using an expository style, interspersed 
with first-person accounts of his own re- 
search, Harris provides an 
authoritative synopsis of de- 
velopmental neuroscience. 
The scope encompasses top- 
ics ranging from basic em- 
bryology to the influences 
of postnatal experience on 
brain development. 

For developmental neu- 
roscientists like myself, 
whose research careers 
stretch back to the 1980s, 
this accounting will serve 
as a delightful revisitation 
of the excitement surround- 
ing noteworthy discoveries 
as they were made and the 
people who made them. For 
students and others new 
to the field, the book will 
provide a comprehensive 
compendium of the most 
important aspects of de- 
velopmental neuroscience. 
Those new to the subject 
may find the proleptic be- 
ginning of the book a bit disorienting, but 
subsequent chapters follow a clear and or- 
derly structure. 

The book begins with the question of 
cell fate, opening with the classic experi- 
ments of Wilhelm Roux, Hans Driesch, and 
Hans Spemann, who worked to determine 
whether the twin cells formed by the first 
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cleavage of a zygote would, if separated, pro- 
duce hemiembryos or two complete twins 
in frogs, sea urchins, and salamanders, re- 
spectively. The answer—twins—formed the 
basis of our understanding of totipotency. 
Cells of this stage are capable of making it 
all. But this raises a grander question: What 
events lead a certain population of embry- 
onic cells down the pathway to become the 
cells in our brains? 

The narrative then jumps to genetic in- 
heritance and single-cell organisms such as 
the paramecium to build an argument that 
“many properties that are characteristic of 
the brain were already encoded in the DNA 
of our single-cell ancestors.” A brief tour of 
comparative neuroanatomy—from jellyfish 
to humans—comes next. Then the story de- 


Countless neural connections are refined as a baby’s brain develops. 


tours to consider the fate of a single frog egg 
after fertilization. The remaining portion of 
the first chapter proceeds through the em- 
bryological milestones of forming a blas- 
tula, gastrulation, and the rise of the neural 
plate, which furrows and folds into the neu- 
ral tube—the forerunner of the brain and 
spinal cord. Subsequent chapters discuss 
how the general body plan is formed, the 
proliferation and differentiation of neurons, 
and the broader topics of neural plasticity 
and human brain evolution. 


Zero to Birth: How the 
Human Brain Is Built 
William A. Harris 

Princeton University Press, 
2022. 272 pp. 


At this point, Harris confronts a seem- 
ingly unfathomable question: How do the 
roughly 100 billion neurons in the human 
brain get wired up properly? The specific 
labeling of each connection is mathemati- 
cally impossible. Instead, the nervous sys- 
tem wires itself up by refining circuitry on 
the basis of performance. Countless connec- 
tions in a baby’s brain are winnowed away, 
its axons and dendrites pruned, and nubile 
neurons and nascent synapses that are less 
competitive in performance 
are eventually eliminated. 

The final chapter consid- 
ers the questions of what 
makes the human brain so 
different from all others and 
what makes your brain so 
different from every other 
human’s. In large measure, 
the answer is found at the 
intersection of biology and 
society. Our brains con- 
tinue to develop after birth, 
guided by the experiences 
we have in the environment 
we are born into. 

The book covers a broad 
scope, but readers must look 
elsewhere for up-to-date 
information on the most 
abundant and diverse type of 
cells in the brain: glia. These 
cells have a profound influ- 
ence on nearly every aspect 
of nervous system develop- 
ment, but they are referenced 
here only superficially. There is no mention of 
Schwann cells, for example, or of microglia. 
These and other glia help to guide neuronal 
development and prune synapses. In consid- 
ering what makes the human brain differ- 
ent, Harris’s omission of astrocytes, which in 
the human cerebral cortex are unlike those 
in any other animal, is another unfortunate 
oversight. On the subject of neuronal devel- 
opment, however, Zero to Birth is accessible, 
authoritative, and comprehensive. & 
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Species in Himalayan headwater streams would be threatened by India’s proposed small hydropower projects. 


Edited by Jennifer Sills 


India must protect 
Himalayan headwaters 


At the 26th UN Climate Change 
Conference of the Parties (COP26), India 
pledged to convert to 50% renewable 
energy use by 2030 (1). The country plans 
to achieve this goal by building 5747 new 
small hydropower projects, more than 
42% of which are planned in Indian 
Himalayan states (2). The 88 existing 
large dams-functional or under construc- 
tion-in India’s Himalayan region have 
already exacerbated habitat fragmenta- 
tion and put local species and ecosystems 
at risk (3, 4). Because the cumulative 
impacts of hundreds of small hydropower 
projects will likely lead to similar environ- 
mental damage, their construction must 
be reconsidered. 

The Indian Himalaya’s extraordinary 
vertebrate diversity (5) faces rapid devel- 
opment and associated habitat loss; 17.3% 
of the region’s fish and amphibians are 
already threatened (5). Headwater aquatic 
species can only tolerate a narrow range 
of cold temperatures, making them par- 
ticularly vulnerable to climate change. As 
temperatures rise, these species are likely 
to move upstream to find cooler habitats 
(6). The proliferation of small hydro- 
power projects will obstruct their path. 
Other species may try to move to higher 
elevations to escape invasive species; the 
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proposed dams will block their access as 
well (7). Moreover, the projects will dis- 
rupt the activities of apex predators that 
depend on headwaters for feeding, breed- 
ing, and migration (8, 9). 

High warming (JO) and seismicity (77) 
rates in the Himalaya make existing small 
hydropower infrastructures unstable. 
During the 2021 Chamoli disaster, for 
example, an avalanche washed out a small 
hydropower project and the resulting mas- 
sive debris flow left 200 people killed or 
missing, with an estimated loss of US$223 
million (12). Meanwhile, the Himalaya’s 
rich natural capital supports millions of 
livelihoods. The small hydropower projects, 
if built as proposed, would impair ecosys- 
tem services and economic growth. 

A policy overhaul—particularly for frag- 
ile landscapes like Himalaya—is urgently 
needed. Renewable plans should focus on 
solar, wind, and biomass strategies instead 
of small hydropower projects. India has 
committed to developing without destroying 
the environment, enhancing climate resil- 
ience, and reducing vulnerability to climate 
change. Only by protecting the Himalaya 
from further fragmentation can the country 
live up to its promises. 
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Addressing patent 
gender disparities 


Women comprise 29% of the science, 
technology, engineering, and mathematics 
(STEM) workforce but only 12.8% of US 
patent inventors (1). Institutional struc- 
tures and biases contribute to the patent 
equity gap (2-4). To increase the number 
of women who hold patents, institutions 
must give everyone more information 
about patents, ensure that women receive 
equal credit for their work, and provide 
women with equitable access to work 
opportunities. 

Women are less likely than their male 
counterparts to pursue patents for their 
inventions because they are less familiar 
with the patent process and less likely 
to see themselves as “inventive” (4, 5). 
Institutions can address these gaps by 
creating inventor education programs that 
use equitable language (6). For example, 
to address differences in self-perception 
between men and women, institutions 
could refer to the paperwork required to 
internally disclose an invention and start 
the patent process as “progress reports” 
or “creativity disclosures” instead of the 
standard “inventor disclosure forms.” 
Institutions could also provide office 
hours and resources to guide new inven- 
tors of all genders through the process. 

In some cases, an inventor applies for 
a patent but unintentionally excludes 
women who collaborated on the project 
from the disclosure. Prepopulating dis- 
closure forms with the names of relevant 
past and present laboratory members can 


science.org SCIENCE 


PHOTO: AASHNA SHARMA 


help negate gender-related biases and 
increase inventor disclosure compliance 
(7, 8). If applicants are prompted to con- 
sider all contributors, they are more likely 
to include inventors whom they might 
otherwise overlook. 

It is possible that women are less likely 
to develop technology eligible for patents 
because they have fewer opportunities 
to achieve their professional goals (4). 
Institutions must recognize and address 
evolving societal circumstances that 
disparately affect women. For example, 
institutions can create more equitable 
opportunities by providing affordable and 
accessible childcare to all employees and 
promoting policies to ensure that admin- 
istrative tasks are not delegated more fre- 
quently to women than to men (3, 4). 

Closing the patent equity gap will 
require data showing how many women 
and men are applying for patents and 
whether specific strategies successfully 
mitigate disparities. All corporate, gov- 
ernmental, and academic institutions 
should collect and make public data on 
the gender distribution of all employees 
compared with the gender representation 
of patent inventors. Gendered patent dis- 
parity should be quantified and tracked 
over time, and effective diversity, equity, 
and inclusion programs [e.g., (9)] should 
be implemented and monitored for suc- 
cess. By making these changes, institu- 
tions can wield their power to promote 
gender equity among patent applicants. 
All inventors deserve to be recognized for 
their creative work. 

Jordana Goodman 


Boston University School of Law, Boston, MA 
02215, USA. Email: jordi@bu.edu 
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Advancing racial equity 
in Brazil’s academia 


People in Brazil who identify as Black, 
including those with major African inheri- 
tance and those who belong to mixed-raced 
ethnic groups (J), have historically been 
underrepresented in postsecondary educa- 
tion as a result of racial and socioeconomic 
inequities (2, 3). In Brazil, public universi- 
ties are free but highly competitive, favoring 
students from wealthier families who gener- 
ally had access to better, private secondary 
education (4). Black activism in Brazil has 
highlighted the inadequate access to edu- 
cation and stimulated debates on how to 
apply antiracist strategies to achieve racial 
equity (5). In 2013, the Labour Party estab- 
lished race-conscious affirmative actions 

in public universities, which provided aid 

to low-income Black people, a change that 
greatly increased the number of Black 
undergraduate and graduate students (6, 7). 
However, President Jair Bolsonaro’s opposi- 
tion to race-based affirmative action and 
hostility toward higher education jeopardize 
these successful measures (8, 9). 

The Bolsonaro administration’s budget 
cuts to science and education limit the 
funding available for programs that advance 
diversity in academia by subsidizing hous- 
ing, food, and transportation for low-income 
public university students, many of whom 
are Black. Given the success of affirmative 
action policies in improving racial equity in 
Brazilian universities (6, 7), such programs 
should be maintained and expanded rather 
than cut. If decision-makers do not reevalu- 
ate this budget, Black people and other 
minorities will likely be pushed out of aca- 
demia, and Brazil will lose a generation of 
bright, underrepresented scientists (9, 10). 

As Black scholars who lived through the 
paradigm shift toward racial equality in 
higher education in Brazil, we express our 
profound concerns about the dismantle- 
ment of diversity initiatives in postsecond- 
ary education. We also demand urgent 
attention from lawmakers to proposed 
legislation PL 1788/21, currently under 
consideration by the Chamber of Deputies, 
that would maintain racial quotas in higher 
education for 10 more years (77). It is urgent 
to bring this discussion to light; without 
race-conscious and socioeconomic affirma- 
tive actions in the next few years, Brazil 
will likely face an unprecedented setback in 
racial equity in academia. 
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Comment on “Cryoforged nanotwinned titanium 
with ultrahigh strength and ductility” 


Yazhou Guo, Xiaolei Wu, Qiuming Wei 

We analyze the results of Zhao et al. (Report, 
17 September 2021, p. 1363) with a focus 

on the mechanical properties and micro- 
structural evolution. We conclude that their 
results, together with the explanations and 
interpretations, are confusing, misleading, 
or even wrong. 

Full text: dx.doi.org/10.1126/science.abo3440 


Response to Comment on “Cryoforged nanotwinned 
titanium with ultrahigh strength and ductility” 


Shiteng Zhao, Ruopeng Zhang, Qin Yu, Jon Ell, 
Robert O. Ritchie, Andrew M. Minor 

We address the three main points of Guo 

et al. They claim that we should have used 
the engineering stress versus engineer- 

ing strain curves to infer the mechanical 
properties of our nanotwinned titanium, 
question our sample design on the basis 

of a finite-element analysis, and doubt the 
immobility of some preexisting grain/twin 
boundaries in our electron backscatter dif- 
fraction micrographs. We find their analysis 
to be groundless and to contain many 
inconsistencies. 

Full text: dx.doi.org/10.1126/science.abo5247 
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BIOMECHANICS 
Following formidable flyers 


lying involves complicated maneuvers, not the least of which 
is the process of returning to an upright flying position after 
being flipped over in midair. Wang et al. used a combination of 
experiments and biophysical models to understand this pro- 
cess in dragonflies, which are adept flyers. The authors found 


Dragonflies use 
visual stimuli 
to coordinate 

righting reflexes 
while flying. 


that righting involves a series of signals beginning with the visual 
system through to wing pitch muscles. This approach revealed 
connections between neural signals and physical processes that 
could be used to study flight mechanics across species. —SNV 
Science, abg0946, this issue p. 754 


Emotional memory 
encoding in REM sleep 


During rapid eye movement 
(REM) sleep, emotional memo- 
ries are consolidated in the 
prefrontal cortex. However, we 
are still far from a mechanistic 
understanding of the processes 
involved. Aime et al. investigated 
he impact of REM sleep on 
cellular and subcellular activi- 
ties, microcircuit connectivity, 
plasticity, and behavior. The 
authors quantified how somatic 
and dendritic activity differ in 
mice during wakefulness, REM 
sleep, and non-REM sleep, and 
probed how interneurons cause 
these differences. They further 
identified a specific area in the 
thalamus that activates a sub- 
population of interneurons that 
decouple the dendrite from the 
somatic output. Finally, by opto- 
genetic intervention during REM 
sleep, learning of discrimination 
between danger and safety could 
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be enhanced or diminished. 
These findings provide a better 
understanding of the processing 
of emotions during sleep. —PRS 
Science, abk2734, this issue p. 724 


Large-area tandem 
perovskite solar cells 


Tandem solar cells allow more of 
the solar spectrum to be used. 
For all-perovskite implementa- 
tions with large illumination 
areas, different bandgap com- 
positions must be grown with 
fully scalable methods. Xiao et 
al. blade coated high-quality, 
wide-bandgap perovskite layers 
by tuning the cesium concentra- 
tion in a mixed solvent system. 
They avoided diffusion between 
the perovskite layers with a tin 
oxide layer grown by atomic layer 
deposition that also served as 
an electron extractor. Small-area 
cells (1 square centimeter) had 

a power conversion efficiency 


(PCE) of about 25%, anda 
20-square-centimeter module 
had a certified PCE of 21.7%. The 
encapsulated tandem module 
maintained more than 75% of its 
initial PCE after maximum power 
point operation for over 500 
hours in ambient air. —PDS 
Science, abn7696, this issue p. 762 


lon enhanced polarization 
Hafnium oxide is an exciting 
material because it has ferro- 
electric behavior that makes 

it attractive for various device 
applications. Kang et al. found 
that the ferroelectric properties 
improve by bombarding films of 
hafnium oxide with a beam of 
helium ions. The ion bombard- 
ment creates oxygen vacancies 
and strain changes from helium 
implantation that push more 

of the polycrystalline samples 
into the ferroelectric ortho- 
rhombic phase. This method 
may become an important tool 


for stabilizing the ferroelectric 
phase for the next generation of 
electronic devices. —BG 

Science, abk3195, this issue p. 731 


Boron flips the script 
on cyclohexane 


Conformational analysis of 
disubstituted cyclohexanes 
played an outsized role in 
the development of organic 
chemistry. It is striking in that 
context that the comparatively 
less stable isomers remain chal- 
lenging to access. Li et al. report 
a versatile method to prepare 
1,2-cis, 1,3-trans, and 1,4-cis- 
disubstituted cyclohexanes, 
each of which requires a sub- 
stituent to adopt the disfavored 
axial orientation. The method 
relies on a bulky boron substitu- 
ent one carbon away from the 
ring to bias the approach of a 
nickel catalyst. —JSY 

Science, abn9124, this issue p. 749 


science.org SCIENCE 


PHOTO: SANDER MEERTINS/ISTOCK.COM 


PHOTOS (LEFT TO RIGHT): TASFOTO/ALAMY STOCK PHOTO; CITIZEN OF THE PLANET/EDUCATION IMAGES/UNIVERSAL IMAGES GROUP/GETTY IMAGES 


An extended history 
of water on mars 


Early Mars was wet and relatively 
warm, but Mars is much colder 
today, and water there mostly 
occurs in the form of ice. When 
this transition occurred is the 
focus of much Mars research. 
Recent remote-sensing evidence 
has suggested that liquid water 
persisted into the past 2.9 billion 
years of Mars history. Liu et al. 
reinforce this argument using 
spectroscopic data obtained by 
China’s Zhurong rover. These data 
include the first surface detection 
of hydrous minerals in a sedimen- 
tary rock from the Utopia Planitia 
region of Mars. They also indicate 
the presence of liquid surface 
water during sediment deposition 
in the area, which likely occurred 
some time in the past billion 
years. —KVH 
Sci. Adv. 10.1126/ 
sciadv.abn8555 (2022). 


wi ee 
Global shifts 
in tidal wetlands 


Ecologically and economically 
important coastal wetlands 
are threatened by sea level rise 
and land use change. Murray 
et al. used high-resolution 
satellite imagery to assess the 
global extent of tidal wetlands 
and changes in wetland extent 
and distribution over the past 
two decades. They found that 
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Tidal wetlands, such as this salt marsh in Schiermonnikoog, Netherlands, are 


although over 13,000 square 
kilometers of tidal wetland have 
recently been lost, much of this 
decreasing extent has been 
offset by the creation of new 
wetlands. The greatest losses 
and gains were in tidal flats, but 
mangrove ecosystems showed 
the largest net decline in area 
globally. Direct human impacts 
on wetlands, including land 
transformation and restoration, 
are detectable from satellite 
imagery and account for 27% of 
wetland losses and gains. —BEL 
Science, abm9583, this issue p. 744 


Beneficial inflammation 
The mechanisms mediating the 
transition from acute to chronic 
pain are not well understood. 
Analyzing samples from patients 
and animal models, Parisien 
et al. found that neutrophil 
activation—dependent inflam- 
matory genes were up-regulated 
in subjects with resolved pain, 
whereas no changes were 
observed in patients with 
persistent pain. In rodents, 
anti-inflammatory treatments 
prolonged pain duration, and 
this effect was abolished by neu- 
trophil administration. Clinical 
data also showed that use of 
anti-inflammatory drugs was 
associated with increased risk of 
persistent pain, suggesting that 
anti-inflammatory treatments 
might have negative effects on 
pain duration. -—MM 

Sci. Transl. Med. 14, eabj9954 (2022). 


dynamic habitats experiencing growth in some locations but greater net loss globally. 
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Neighborhood pollution discriminates 


ne of the strongest predictors of health in the United 
States is zip (postal) code because social and environ- 
mental conditions can vary so much by location. The 
health status of poor and ethnic minority neighborhoods 
is not only compromised by a lack of resources but also 
by higher air pollution levels in immediate surroundings and 
adjacent neighborhoods. However, it is unclear whether pollution 
exposure changes when people travel to nonadjacent neighbor- 
hoods for employment, errands, or social engagements. Brazil 
used mobile phone location data collected across the 100 larg- 
est US cities and found that Hispanic neighborhoods (whether 
poor or more affluent) were exposed to the highest pollution 
levels and residents tended to travel to nonadjacent regions also 
experiencing high pollution. By contrast, more affluent, predomi- 
nantly white neighborhoods were least exposed to air pollution. 
Therefore, minorities can carry a previously unmeasured 
additional burden of health disadvantage across their social net- 
works. —EEU Proc. Natl. Acad. Sci. U.S.A. 119, e2117776119 (2022). 


Residents of poor and ethnic minority neighborhoods tend to be exposed 
to unacceptable levels of air pollution at all points of their social networks. 


Amicrobial 
oncometabolite 


The gut microbiota have been 
associated with various types of 
cancer. For example, particular 
bacterial species may contribute 
to the initiation and progression 


of colorectal cancer (CRC). 
Fusobacterium nucleatum has 
been identified as a pro-CRC 
bacterium, but the underlying 
mechanism is unknown. Ternes 
et al. confirmed that F. nucleatum 
was abundant in stool samples 
from 63 patients with CRC. In 
coculture experiments, they 
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ANTHROPOLOGY 


Human resilience to climate change 


tis difficult to determine how past human populations coped 
with climate change. Cultural transformations and climate 
events have certainly been correlated by historians. Plunkett and 
Swindles challenge the view that human societies are necessarily 
vulnerable to collapse in the face of climate change. A marginal 
(cool “mossy bog”) upland landscape in Northern Ireland has 
been continuously occupied since the Middle Ages despite climate 


oscillation. A peat core sample supplied detailed high-resolution 
climate proxy data from volcanic ash, fossil, and pollen deposits. 
These data, coupled with records of plague, war, and famines, show 
that none of these events interrupted the mixed agricultural subsis- 
tence economy of this society for long. Either the population was 
sufficiently isolated to escape the consequences of these events or 
robust social organization allowed them to recover. Either way, this 
detailed record from western Europe provides an indication of the 
adaptability of some human social organizations during environ- 
mental stress. —CA PLOS ONE 17, €0266680 (2022). 


Despite climate oscillations, wars, and famines, the peat uplands of Slieveanorra, Northern Ireland, have been continuously occupied by humans for over 1000 years. 


found that F. nucleatum altered 
the signaling and metabolism of 
cells from patient CRC, and this 
was caused by the increased 
secretion of formate by F. nuclea- 
tum. Furthermore, administration 
of formate increased CRC tumor 
size and metastasis in vivo, 
indicating that formate is an 
oncometabolite that drives CRC 
progression. —GKA 


Nat. Metab. 4,458 (2022). 


SYNTHETIC BIOLOGY 
Bacteria engineered 
to protect the gut 


Although treatment with broad- 
spectrum antibiotics provides 
invaluable protection from 
infections, it also can also disrupt 
microbial communities in the 
gut. Cubillos-Ruiz et al. propose a 
strategy in which treated animals 
are fed bacteria that produce and 
secrete an enzyme to degrade 
8-lactam antibiotics in the gut. 
This tactic allows the antibiotic 
to reach effective concentra- 
tions in the serum but decreases 
disruption of the gut microbiota. 
Critically, the engineered bacteria 
are not themselves resistant 

to antibiotic, and the two-gene 
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biosynthetic strategy used is 
unlikely to allow transfer of resis- 
tance. —LBR 
Nat. Biomed. Eng. 10.1038/ 
$41551-022-00871-9 (2022). 


PHYSICS 
Thermal Hall effect 
and phonons 


The material a-RuCl, has gar- 
nered much interest as a potential 
realization of the so-called 

Kitaev spin liquid. A key piece 

of evidence has come from the 
observation of half-quantization 
of the thermal Hall conductiv- 

ity, which is consistent with the 
predictions of the Kitaev model. 
However, these measurements 
show strong sample dependence, 
which complicates interpreta- 
tion. Lefrancois et al. measured 
thermal Hall conductivity as 

well as longitudinal thermal 
conductivity in five different 
samples of a-RuCl, Based ona 
striking similarity between the 
temperature dependence of the 
phonon-dominated longitudinal 
thermal conductivity and thermal 
Hall conductivity, the research- 
ers concluded that the thermal 
Hall effect likely has a large 


contribution from phonons, which 
should be taken into account 
when interpreting similar experi- 
ments. —JS 

Phys. Rev. X12, 021025 (2022). 


QUANTUM NETWORKS 
Adynamic 
quantum memory 


Because they are robust and 
fast, photons are ideal carriers 
of quantum information in long- 
distance quantum networks. 

For a practical communication 
platform, the photons also 

need to be manipulated at the 
local level at nodes or exchange 
points across the network, 
requiring information to be 
reliably encoded, stored, and 
the information retrieved from 
the photonic qubits. Craiciu et 
al. demonstrate such func- 
tionality using an ensemble of 
erbium ions embedded within 

in a solid-state matrix of yttrium 
orthosilicate. With the ions 
operating as a dynamic quantum 
memory for photons at com- 
munication wavelengths and the 
approach being compatible with 
integrated photonics, the results 
are promising for the possible 


integration into large-scale 
optical fiber networks. —ISO 
Optica 8, 114 (2021). 


MARINE HEATWAVES 
The long and the 
short of it 


Continued global warming has 
increased the frequency and 
severity of marine heatwaves. 
These events, of finite duration 
ike their atmospheric counter- 
parts, can affect both the physical 
and biological environments of 
the regions in which they occur. 
Mogen et al. studied the first year 
of the Northeast Pacific marine 
heat wave of 2013 to 2016, popu- 
arly referred to as the “Blob,” and 
found that although it temporarily 
mitigated the effects of ocean 
acidification, because of the 
changes that it induced in ocean 
circulation and mixing, it also 
reduced ocean oxygen content 
through warming. These results 
are early steps toward a better 
understanding of the potential 
impacts of multiple stressors that 
may occur during marine heat 
waves. —HJS 


Geophys. Res. Lett. 
10.1029/2021GL096938 (2022). 
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HUMAN CELL ATLAS 
Mapping cell types 
in the human body 


The Tabula Sapiens is a 
molecular reference atlas for 
more than 400 cell types of the 
human body. The Tabula Sapiens 
Consortium used single-cell 
transcriptomics to measure the 
messenger RNA molecules in 
each of nearly 500,000 cells 
from 24 tissues and organs 
(see the Perspective by Liu and 
Zhang). These data enable new 
insights into how the human 
genome parts list is used to 
create distinct cell types within 
the human organism. In addition 
to creating a detailed molecular 
definition of these cell types, the 
atlas reveals many other aspects 
of human biology, including how 
the same gene can be spliced 
differently in different cell types, 
how shared cell types in different 
tissues can have subtle differ- 
ences in their identities, and how 
clones of the immune system 
can be shared across tissues. — 
LMZ and DJ 

Science, abl4896, this issue p. 711; 

see also abq2116, p.695 


HUMAN CELL ATLAS 
Cartography 
of human cells 


The function of disease genes 
active in different cell types is 
modulated to meet the needs 
of the different tissues and 
organs in which the cells reside. 
Resolving these differences is 
critical to understanding homeo- 
stasis and disease. However, 
single-cell atlases generated 
to date have largely focused on 
individual tissues. Eraslan et 
al. applied single-nucleus RNA 
sequencing to frozen, banked 
samples from eight healthy 
human organs from 16 donors 
and characterized cell popula- 
tions across tissues, including 
tissue-resident myeloid and 
fibroblast populations, and 
their role in tissue support and 
immunity (see the Perspective 
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by Liu and Zhang). Using this 
cross-tissue atlas, the authors 
linked specific cell populations 
to monogenic and polygenic 
diseases, suggesting cell- and 
tissue-specific programs. —LMZ 
and DJ 

Science, abl4290, this issue p. 712; 

see also abq2116, p.695 


HUMAN CELL ATLAS 
Immune cell diversity in 
the human body 


The human immune system is 
composed of many different cell 
types spread across the entire 
body, but little is currently known 
about the fine-grained variations 
in these cell types across organs. 
Using single-cell genomics, 
Dominguez Conde et al. exam- 
ined the gene expression profile 
of more than 300,000 individual 
immune cells extracted from 16 
different tissues in 12 deceased 
adult organ donors (see the 
Perspective by Liu and Zhang). 
Cell identity was assigned using 
CellTypist, an automated cell 
classification tool designed 
by the authors. In-depth data 
analysis revealed insights into 
how the immune system adapts 
to function effectively in different 
organ contexts. —LMZ and DJ 
Science, ab|5197, this issue p.713; 
see also abq2116, p.695 


MICROBIOTA 
Gut bacteria go 
on the record 


Bacteria in the gut dynamically 
respond to each another, as 
well as to their host's diet and 
immune system. However, cur- 
rent approaches to measuring 
changes in gene expression in 
this setting have been limited 
and are generally invasive. 
Schmidt et a/. use a CRISPR- 
based recording method 
(Record-seq) to capture the 
transcriptional changes that 
occur in Escherichia coli bac- 
teria as they pass through the 
intestines (see the Perspective 


by Zahavi and Segal). Deep 
sequencing of these DNA record- 
ings revealed characteristic 
signatures based on dietary, 
host, and microbiota contents at 
different levels of the intestine. 
This work provides additional 
perspectives on how diet, 
inflammation, and microbia 
interactions in the body shape 
the health of mammalian hosts. 
—STS 
Science, abm6038, this issue p. 714; 
see also abq1455, p.697 


NANOCHEMISTRY 
Fluorine for faster 
water transport 


Typically, as the diameter of a 
pipe or channel is shrunk, the 
flow rate per cross-sectional 
area will decrease because of the 
greater surface area to volume 
ratio. Itoh et al. designed a series 
of fluorous oligoamide nanorings 
that undergo supramolecular 
polymerization to form nano- 
channels with dense packing 
of organofluorine groups (see 
the Perspective by Shen). 
The intense electronegativity 
disrupts the formation of water 
clusters, so that individual water 
molecules flow through the 
smallest channels faster than 
through larger ones. Chloride 
ions are strongly repelled and 
cannot travel through the chan- 
nels. —MSL 

Science, abd0966, this issue p. 738; 

see also abo2953, p.698 


FOREST ECOLOGY 
Uncoupled carbon 
uptake and storage 


Forests are expected to help 
mitigate climate change by 
sequestering carbon, with 
elevated carbon dioxide 
boosting photosynthesis and 
carbon uptake. However, the 
amount of carbon that can be 
stored in wood also depends 

on temperature, water, and 
nutrient availability. Cabon et al. 
examined temporal correlations 
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between trees’ carbon uptake 
and woody growth by com- 
bining data on tree rings and 
gross primary productivity 
measures from 78 forests with 
carbon dioxide flux towers (see 
the Perspective by Green and 
Keenan). They found weak corre- 
lations between productivity and 
woody growth, which responded 
differently to seasonal tempera- 
tures and water availability. Their 
work shows that limits to tree 
growth, particularly in dry and 
cold areas, may constrain poten- 
tial forest carbon storage. —BEL 
Science, abm4875, this issue p. 758; 
see also abo6547, p.692 


VACCINES 


MAdCAM-1 about aging 


As people age, their immune 
responses to vaccines and 
infections diminish, increasing 
the risk for serious illness. These 
diminished immune responses 
have been tied to poor germi- 
nal center responses in older 
people, yet the exact mechanism 
for how this effect occurs is 
unclear. Denton et al. vaccinated 
adult or aged mice with a model 
immunogenic protein to track 
germinal center responses 
overtime. They found that the 
microenvironment of the lymph 
node was impaired in older mice, 
as defined by a poor response 
of MAdCAM-1* stromal cells to 
vaccination. These cells were 
partially rescued with a Toll-like 
receptor 4 (TLR4) agonist, which 
improved the initiation of germi- 
nal center immune responses. 
Together, these data suggest 
that targeting TLR4 might 
improve the efficacy of vaccina- 
tion in older people. —DAE 

Sci. Immunol. 7, eabkO018 (2022). 


NEUROSCIENCE 
Bridging adhesion 
signaling for learning 


Biliverdin reductase (BVR) 
and its product, bilirubin, have 
antioxidant functions in the 
brain. Vasavda et al. found an 
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additional synaptic role for BVR 
that was independent of either 
bilirubin or biliverdin. BVR linked 
focal adhesion kinases to an 
effector kinase at synapses 

in the mouse hippocampus, 
which enabled signaling 
between these kinases and 
synaptic plasticity, a process 
that underlies learning in which 
synaptic strength is calibrated 
in response to changing inputs. 
Mice lacking BVR had impaired 
focal adhesion signaling and 
showed performance deficits on 
cognitive tests. —LKF 

Sci. Signal. 15, abh3066 (2022). 


BATTERIES 
Theory-guided 
materials research 


Areliable energy storage 
ecosystem with high energy 
density and long lifetime 
storage is imperative for a 
renewable energy future. 
However, the further develop- 
ment of rechargeable batteries 
lacks a better theoretical and 
experimental understanding of 
electrochemical mechanisms 
and structure-property relation- 
ships. In a Review, Eng et al. 
discuss the interplay between 
theory and experiment in battery 
materials research, the combina- 
tion of which will contribute to 
new material predictions and 
investigations of mechanisms. 
The authors focus on specific 
examples in state-of-the-art 
ithium-ion, lithium-metal, 
sodium-metal, and all-solid-state 
batteries. This theory-exper- 
iment framework can also 
be extended to multivalent 
batteries. Interdisciplinary col- 
aboration is expected to provide 
universal principles to develop 
next-generation rechargeable 
batteries. —XWDL 
Sci. Adv. 10.1126/ 
sciadv.abm2422 (2022). 


NEUROIMMUNOLOGY 
T cells in the elderly 


touch a nerve 

With age, the ability to bounce 
back from nervous system 
injuries becomes increasingly 


limited. However, the molecu- 
lar and cellular mechanisms 
underlying aging-dependent 
regenerative decline remain 
poorly understood. Zhou et 
al. used RNA sequencing and 
imaging to study dorsal root 
ganglia in a mouse model of 
sciatic nerve injury (see the 
Perspective by Gaudenzio and 
Liblau). Compared with younger 
mice, injured DRG sensory 
neurons in older mice expressed 
higher levels of the chemo- 
kine CXCL13, which attracted 
CXR5*CD8* T cells. T cell—neu- 
ron cross-talk activated caspase 
3, which impaired pAKT, and 
pS6 signaling causing regenera- 
tive failure. When caspase 3, 
CXCR85, or CXCL13 was thera- 
peutically targeted, the authors 
were able to restore neuronal 
regeneration and recovery in 
older mice to levels comparable 
to their younger counterparts. 
—STS 

Science, abd5926, this issue p. 715; 

see also abp9878, p.694 


QUANTUM SIMULATION 
Hydrodynamics 
of spin chains 


Computing the dynamics of 
quantum many-body sys- 
tems is notoriously difficult. 
Nevertheless, at long times after 
a quench, even these compli- 
cated systems are predicted to 
feature hydrodynamic trans- 
port properties. Two groups 
have used widely tunable and 
exquisitely controllable atomic 
systems to study the dynamics 
of spin propagation in quan- 
tum magnetic chains (see the 
Perspeective by Morningstar 
and Bakr). Joshi et al. varied the 
range of interactions in a chain 
of calcium ions and observed 
emergent hydrodynamics in 
several different universality 
classes. Wei et al. used a quan- 
tum gas microscope to monitor 
spin transport in a Heisenberg 
chain of cold atoms and found 
that the system exhibited the 
so-called Kardar-Parisi-Zhang 
superdiffusion. —JS 
Science, abk2397, abk2400, 
this issue p. 716, p. 720; 
see also abn6376, p.699 
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HUMAN CELL ATLAS 


The Tabula Sapiens: A multiple-organ, single-cell 
transcriptomic atlas of humans 


The Tabula Sapiens Consortium* 


INTRODUCTION: Although the genome is often 
called the blueprint of an organism, it is per- 
haps more accurate to describe it as a parts list 
composed of the various genes that may or 
may not be used in the different cell types of a 
multicellular organism. Although nearly every 
cell in the body has essentially the same ge- 
nome, each cell type makes different use of that 
genome and expresses a subset of all possible 
genes. This has motivated efforts to character- 
ize the molecular composition of various cell 
types within humans and multiple model or- 
ganisms, both by transcriptional and proteomic 
approaches. We created a human reference atlas 
comprising nearly 500,000 cells from 24 differ- 
ent tissues and organs, many from the same 
donor. This atlas enabled molecular character- 
ization of more than 400 cell types, their dis- 


Microbiome 


Single-cell os , 
transcriptomics “af 


tribution across tissues, and tissue-specific 
variation in gene expression. 


RATIONALE: One caveat to current approaches to 
make cell atlases is that individual organs are 
often collected at different locations, collected 
from different donors, and processed using 
different protocols. Controlled comparisons of 
cell types between different tissues and organs 
are especially difficult when donors differ in 
genetic background, age, environmental expo- 
sure, and epigenetic effects. To address this, 
we developed an approach to analyzing large 
numbers of organs from the same individual. 


RESULTS: We collected multiple tissues from 
individual human donors and performed co- 
ordinated single-cell transcriptome analyses on 


Histopathology 


Overview of Tabula Sapiens. Molecular characterization of cell types using single-cell transcriptome sequencing 
is revolutionizing cell biology and enabling new insights into the physiology of human organs. We created a 
human reference atlas comprising nearly 500,000 cells from 24 different tissues and organs, many from the same 
donor. This multimodal atlas enabled molecular characterization of more than 400 cell types. 
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live cells. The donors come from a range of eth- 
nicities, are balanced by gender, have a mean 
age of 51 years, and have a variety of medical 
backgrounds. Tissue experts used a defined 
cell ontology terminology to annotate cell types 
consistently across the different tissues, leading 
to a total of 475 distinct cell types with refer- 
ence transcriptome profiles. The full dataset 
can be explored online with the cellxgene tool. 
Data were collected for the bladder, blood, 
bone marrow, eye, fat, heart, kidney, large intes- 
tine, liver, lung, lymph node, mammary, muscle, 
pancreas, prostate, salivary gland, skin, small 
intestine, spleen, thymus, tongue, trachea, uterus, 
and vasculature. Fifty-nine separate specimens 
in total were collected, processed, and analyzed, 
and 483,152 cells passed quality control filtering. 
On a per-compartment basis, the dataset in- 
cludes 264,824 immune cells, 104,148 epithelial 
cells, 31,691 endothelial cells, and 82,478 stro- 
mal cells. Working with live cells, as opposed 
to isolated nuclei, ensured that the dataset in- 
cludes all mRNA transcripts within the cell, in- 
cluding transcripts that have been processed by 
the cell’s splicing machinery, thereby enabling 
insight into variation in alternative splicing. 
The Tabula Sapiens also provided an oppor- 
tunity to densely and directly sample the 
human microbiome throughout the gastro- 
intestinal tract. The intestines from two donors 
were sectioned into five regions: the duo- 
denum, jejunum, ileum, and ascending and 
sigmoid colon. Each section was transected, 
and three to nine samples were collected from 
each location, followed by amplification and 
sequencing of the 16S ribosomal RNA gene. 


CONCLUSION: The Tabula Sapiens has revealed 
discoveries relating to shared behavior and 
subtle, organ-specific differences across cell 
types. We found T cell clones shared between 
organs and characterized organ-dependent 
hypermutation rates among B cells. Endothelial 
cells and macrophages are shared across tis- 
sues, often showing subtle but clear differences 
in gene expression. We found an unexpectedly 
large and diverse amount of cell type-specific 
RNA splice variant usage and discovered and 
validated many previously undefined splices. 
The intestinal microbiome was revealed to 
have nonuniform species distributions down 
to the 3-inch (7.62-cm) length scale. These 
are but a few examples of how the Tabula 
Sapiens represents a broadly useful reference 
to deeply understand and explore human 
biology at cellular resolution. m 


The complete list of authors and their affiliations is available in 
the full article online. 

*Corresponding author: Stephen R. Quake (steve@quake-lab.org) 
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HUMAN CELL ATLAS 


The Tabula Sapiens: A multiple-organ, single-cell 
transcriptomic atlas of humans 


The Tabula Sapiens Consortium* 


Molecular characterization of cell types using single-cell transcriptome sequencing is revolutionizing 
cell biology and enabling new insights into the physiology of human organs. We created a human 
reference atlas comprising nearly 500,000 cells from 24 different tissues and organs, many from the 
same donor. This atlas enabled molecular characterization of more than 400 cell types, their distribution 
across tissues, and tissue-specific variation in gene expression. Using multiple tissues from a single 
donor enabled identification of the clonal distribution of T cells between tissues, identification of the 
tissue-specific mutation rate in B cells, and analysis of the cell cycle state and proliferative potential of 
shared cell types across tissues. Cell type-specific RNA splicing was discovered and analyzed across 


tissues within an individual. 


Ithough the genome is often called the 

blueprint of an organism, it is perhaps 

more accurate to describe it as a parts 

list composed of the various genes that 

may or may not be used in the different 
cell types of a multicellular organism. Al- 
though nearly every cell in the body has es- 
sentially the same genome, each cell type makes 
different use of that genome and expresses a 
subset of all possible genes (7). Therefore, the 
genome in and of itself does not provide an 
understanding of the molecular complexity 
of the various cell types of that organism. This 
has motivated efforts to characterize the mo- 
lecular composition of various cell types within 
humans and multiple model organisms, both 
by transcriptional (2) and proteomic (3, 4) 
approaches. 

Although such efforts are yielding insights 
(5-7), one caveat to current approaches is that 
individual organs are often collected at differ- 
ent locations or from different donors (8), are 
processed using different protocols, or lack 
replicate data (9). Controlled comparisons of 
cell types between different tissues and organs 
are especially difficult when donors differ in 
genetic background, age, environmental expo- 
sure, and epigenetic effects. To address this, 
we developed an approach to analyze large 
numbers of organs from the same individual 
(10), which we originally used to characterize 
age-related changes in gene expression in var- 
ious cell types in the mouse (Z7). 


Data collection and cell type representation 


We collected multiple tissues from individual 
human donors (designated TSP1 to TSP15) and 
performed coordinated single-cell transcrip- 
tome analysis on live cells (12). We collected 


All authors and their affiliations appear at the end of this paper. 
*Corresponding author: Stephen R. Quake (steve@quake-lab.org) 


Tabula Sapiens Consortium, Science 376, eabl4896 (2022) 


17 tissues from one donor, 14 tissues from a 
second donor, and five tissues from two other 
donors (Fig. 1). We also collected smaller num- 
bers of tissues from a further 11 donors, creating 
biological replicates for nearly all tissues. The 
donors come from a range of ethnicities, are 
balanced by gender, have a mean age of 51 years, 
and have a variety of medical backgrounds 
(table S1). Single-cell transcriptome sequencing 
was performed with both fluorescence-activated 
cell sorting (FACS)-sorted cells in well plates 
with smart-seq2 amplification as well as 10x 
microfluidic droplet capture and amplification 
for each tissue (fig. S1). Tissue experts used a 
defined cell ontology terminology to annotate 
cell types consistently across the different tis- 
sues (13), leading to a total of 475 distinct cell 
types with reference transcriptome profiles 
(tables S2 and S3). The full dataset can be ex- 
plored online with the cellxgene tool through 
the Tabula Sapiens data portal (/4). 

Data were collected for the bladder, blood, 
bone marrow, eye, fat, heart, kidney, large in- 
testine, liver, lung, lymph node, mammary, 
muscle, pancreas, prostate, salivary gland, 
skin, small intestine, spleen, thymus, tongue, 
trachea, uterus, and vasculature. Fifty-nine 
separate specimens in total were collected, 
processed, and analyzed, and 483,152 cells 
passed quality control (QC) filtering (figs. S2 to 
S7 and table S2). On a per-compartment basis, 
the dataset includes 264,824 immune cells, 
104,148 epithelial cells, 31,691 endothelial 
cells, and 82,478 stromal cells. Working with 
live cells as opposed to isolated nuclei ensured 
that the dataset includes all mRNA transcripts 
within the cell, including transcripts that have 
been processed by the cell’s splicing machin- 
ery, thereby enabling insights into variations 
in alternative splicing. 

To characterize the relationship between 
transcriptome data and conventional histologic 
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analysis, a team of pathologists analyzed hem- 
atoxylin and eosin (H&E)-stained sections pre- 
pared from nine tissues from donor TSP2 and 
13 tissues from donor TSP14 (74). Cells were 
identified by morphology and classified broad- 
ly into epithelial, endothelial, immune, and 
stromal compartments as well as rarely de- 
tected peripheral nervous system (PNS) cell 
types. (Fig. 2A). These classifications were 
used to estimate the relative abundances of 
cell types across the four compartments and 
to estimate the uncertainties in these abun- 
dances resulting from spatial heterogeneity 
of each tissue type (Fig. 2B and fig. S8). We 
compared the histologically determined abun- 
dances with those obtained by single-cell se- 
quencing (fig. S9). Although, as expected, there 
can be substantial variation between the abun- 
dances determined by these methods, in ag- 
gregate, we observed broad concordance over 
a large range of tissues and relative abun- 
dances. This approach enables an estimate 
of true cell type proportions because not every 
cell type survives dissociation with equal effi- 
ciency (15). For several of the tissues, we also 
performed literature searches and collected 
tables of prior knowledge of cell type identity 
and abundance within those tissues (table S4). 
We compared literature values with our ex- 
perimentally observed frequencies for three 
well-annotated tissues: the lung, muscle, and 
bladder (fig. S10). 


Immune cells: Variation in gene expression 
across tissues and a shared lineage history 


The Tabula Sapiens can be used to study dif- 
ferences in the gene expression programs and 
lineage histories of cell types that are shared 
across tissues. We analyzed tissue differences 
in the 36,475 macrophages distributed among 
20 tissues because tissue-resident macrophages 
are known to carry out specialized functions 
(6). These shared and orthogonal signatures 
are summarized in a correlation map (fig. S11A). 
For example, macrophages in the spleen were 
different from most other macrophages, and 
this was driven largely by higher expression of 
CD5L, a regulator of lipid synthesis (fig. SIIB). 
We also observed a shared signature of ele- 
vated epiregulin (EREG) expression in solid tis- 
sues, such as the skin, uterus, and mammary, 
compared with circulatory tissues (fig. S11B). 
We characterized lineage relationships be- 
tween T cells by assembling the T cell receptor 
sequences from donor TSP2. Multiple T cell 
lineages were distributed across various tis- 
sues in the body, and we mapped their rela- 
tionships (Fig. 3A). Large clones often reside in 
multiple organs, and several clones of mucosal- 
associated invariant T cells are shared across 
donors (fig. S11C); these cells had character- 
istic expression of TRAVI-2 because they are 
thought to be innate-like effector cells (77). Line- 
age information can also reveal tissue-specific 
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Fig. 1. Overview of Tabula Sapiens. The Tabula Sapiens was constructed with data from 15 human donors; 
for detailed information on which tissues were examined for each donor, please refer to table S2. 
Demographic and clinical information about each donor is listed in the supplementary materials and methods 
and in table S1. Donors 1, 2, 7, and 14 contributed the largest number of tissues each, and the number of 
cells from each tissue is indicated by the size of each circle. Tissue contributions from additional donors 
who contributed single or small numbers of tissues are shown in the additional 11 donors column, and the 
total number of cells for each organ are shown in the final column on the right. 


somatic hypermutation rates in B cells. We 
assembled the B cell receptor sequences from 
donor TSP2 and inferred the germline ances- 
tor of each cell. The mutational load varies 
markedly by tissue of residence, with blood 
having the lowest mutational load compared 
with solid tissues (fig. S11D). Solid tissues have 
an order of magnitude more mutations per 
nucleotide (mean = 0.076; SD = 0.026) com- 
pared with the blood (mean = 0.0069), which 
suggests that the immune infiltrates of solid 
tissues are dominated by mature B cells. 

B cells also undergo class-switch recombina- 
tion that diversifies the humoral immune re- 
sponse by using constant region genes with 
distinct roles in immunity. We classified every 
B cell in the dataset as immunoglobulin A 
(IgA)-, IgG-, or IgM-expressing and then cal- 
culated the relative amounts of each cellular 
isotype in each tissue (Fig. 3B and table S5). 
Secretory IgA is known to interact with path- 
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ogens and commensals at the mucosae, IgG is 
often involved in direct neutralization of path- 
ogens, and IgM is typically expressed in naive 
B cells or is secreted in the first response to 
pathogens. Consistent with this, our analysis 
revealed opposing gradients of prevalence of 
IgA- and IgM-expressing B cells across the tis- 
sues, with blood having the lowest relative 
abundance of IgA-producing cells and the large 
intestine having the highest relative abundance 
(and the converse for IgM-expressing B cells) 
(Fig. 3B). 


Endothelial cell subtypes with tissue-specific 
gene expression programs 


As another example of analyzing shared cell 
types across organs, we focused on endothelial 
cells (ECs). Although ECs are often categorized 
as a single cell type, they exhibit differences in 
morphology, structure, and immunomodula- 
tory and metabolic phenotypes depending on 
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their tissue of origin. We discovered that tissue 
specificity is also reflected in their transcrip- 
tomes because ECs mainly cluster by tissue of 
origin (table S6). Uniform manifold approxi- 
mation and projection (UMAP) analysis (fig. 
$12A) revealed that the lung, heart, uterus, 
liver, pancreas, fat, and muscle ECs exhibited 
the most-distinct transcriptional signatures, 
reflecting their highly specialized roles. These 
distributions were conserved across donors 
(fig. SI2B). 

Notably, ECs from the thymus, vasculature, 
prostate, and eye were similarly distributed 
across several clusters, which suggests not only 
similarity in transcriptional profiles but in 
their sources of heterogeneity. Differential 
gene expression analysis between ECs from 
these 16 tissues revealed several canonical and 
previously undescribed tissue-specific vascular 
markers (Fig. 3C). These data recapitulate 
tissue-specific vascular markers, such as LCN1 
(tear lipocalin) in the eye, ABCG2 (transporter 
at the blood-testis barrier) in the prostate, and 
OIT3 (oncoprotein-induced transcript 3) in 
the liver. Of the potential previously unde- 
scribed markers determined by this analysis, 
SLC14A1 (solute carrier family 14 member 1) 
appears to be a specific marker for endothe- 
lial cells in the heart, whose expression was 
independently validated with data from the 
Human Protein Atlas (78) (fig. S13). 

Lung ECs formed two distinct populations, 
which is in line with the aerocyte (aCap - 
EDNRB+) and general capillary (gCap - PLVAP+) 
cells described in the mouse and human lung 
(19) (fig. S12, C and D). The transcriptional 
profile of gCaps were also more similar to 
ECs from other tissues, indicative of their 
general vascular functions in contrast to the 
more specialized aCap populations. Lastly, we 
detected two distinct populations of ECs in 
the muscle, including a MSX1+ population 
with strong angiogenic and endothelial cell 
proliferation signatures and a CYP1B1+ pop- 
ulation enriched in metabolic genes, which 
suggests the presence of functional special- 
ization in the muscle vasculature (fig. S12, E 
and F). 


Alternative splice variants are cell 

type specific 

We used SICILIAN (20) to identify alternative 
splice junctions in Tabula Sapiens using both 
10x and smart-seq2 sequencing technologies 
and found a total of 955,785 junctions (fig. S14, 
A to E, and table S7). Of these, 217,855 were 
previously annotated, so our data provide in- 
dependent validation of 61% of the total junc- 
tions cataloged in the entire RefSeq database. 
Although annotated junctions made up only 
22.8% of the total junctions, they represent 
93% of total reads, which indicates that 
previously annotated junctions tend to be 
expressed at higher levels than previously 
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Fig. 2. Comparison of single-cell 
transcriptomics with conventional 
histology. Clinical pathology was 
performed on nine tissues from donors 
TSP2 and TSP14. (A) H&E-stained 
image used for histology of the colon 
from TSP2, with compartments 

(solid colored lines) and individual 

cell types (dashed black ellipses) 
identified by the pathologists. (B) Coarse 
cell type representation of TSP2 

as morphologically estimated by 
pathologists across several tissues, 
ordered by increasing heterogeneity of 
the tissue. Compartment colors are 
consistent between (A) and (B). 


Relative compartment abundacy (%) 


unidentified junctions. We additionally found 
34,624 junctions between previously anno- 
tated 3' and 5’ splice sites (3.6%). We iden- 
tified 119,276 junctions between a previously 
annotated site and a previously unannotated 
site in the gene (12.4%). This leaves 584,030 
putative junctions for which both splice sites 
were previously unannotated—i.e., ~61% of 
the total detected junctions. Most of these have 
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at least one end in a known gene (94.7%), 
whereas the remainder represent potential 
previously undescribed splice variants from 
unannotated regions (5.3%). In the absence of 
independent validation, we conservatively char- 
acterized all of the unannotated splices as 
putative previously unknown junctions. We 
then used the GTEx database (27) to seek 
independent corroborating evidence of these 
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putative junctions and found that reads cor- 
responding to nearly one-third of these prev- 
iously unknown junctions can be found within 
the GTEx data (table $7); this corresponds 
to >300,000 previously undefined validated 
splice variants revealed by the Tabula Sapiens. 

Hundreds of splice variants are used in a 
highly cell type-specific fashion; these can be 
explored in the cellxgene browser (/4), which 
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Fig. 3. Analysis of immune and endothelial cell types shared across tissues. 
(A) Illustration of clonal distribution of T cells across multiple tissues. The majority of 
T cell clones are found in multiple tissues and represent a variety of T cell subtypes. 
nk cell, natural killer cell. (B) Prevalence of B cell isotypes across tissues, ordered by 
decreasing abundance of IgA. (C) Expression levels of tissue-specific endothelial 


uses a Statistic called SpliZ (22). We focus on 


two examples of cell type-specific splicing of | variants (fig. S15). 


three other genes with well-characterized splice 
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markers, shown as violin plots, in the entire dataset. Many of the markers are highly 
tissue specific and typically were derived from multiple donors, as follows: bladder 
(3 donors), eye (2), fat (2), heart (1), liver (2), lung (3), mammary (1), muscle (4), 
pancreas (2), prostate (2), salivary gland (2), skin (2), thymus (2), tongue (2), uterus 
(1), and vasculature (2). A detailed donor-tissue breakdown is available in table S2. 


ticularly that involving the inclusion or exclu- 
sion of exon 6 (Fig. 4A), varies in a cell type- 


two well-studied genes: MYL6 and CD47. Sim- 
ilar cell type-specific splice usage was also 
observed with TPM1, TPM2, and ATP5FIC, 
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MYL6 is an essential light chain (ELC) for 
myosin and is highly expressed in all tissues 
and compartments. Yet, splicing of MYL6, par- 
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and compartment-specific manner (Fig. 4B). 
Although the isoform excluding exon 6 has 
previously been mainly described in phasic 
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Fig. 4. Alternative splicing analysis. (A and B) The sixth exo! 
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0 
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n in MYL6 is skipped 


at different proportions in different compartments. Cells in the immune and epithelial 
compartments tend to skip the exon, whereas cells in the endothelial and stromal 
compartments tend to include the exon. Boxes are grouped by compartment and 
colored by tissue. The fraction of junctional reads that include exon 6 was calculated 
for each cell with >10 reads mapping to the exon-skipping event. Horizontal box 


smooth muscle (23), we discovered that it can 
also be the predominant isoform in non-smooth 
muscle cell types. Our analysis establishes 
pervasive regulation of MYL6 splicing in many 
cell types, such as endothelial and immune cells. 
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These previously unknown, compartment- 
specific expression patterns of the two MYL6 
isoforms are reproduced in multiple individ- 
uals from the Tabula Sapiens dataset (Fig. 4, 
A and B). 
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Average splice position 


plots in (B) show the distribution of exon inclusion in each cell type. (€ and 

D) Alternative splicing in CD47 involves one 5’ splice site (exon 11; 108,047,292) and 
four 3' splice sites. Horizontal box plots in (D) show the distribution of weighted 
averages of alternative 3' splice sites in each cell type. Epithelial cells tend to use exons 
closer to the 5' splice site compared with immune and stromal cells. Boxes are 
grouped by compartment and colored by tissue. 


CD47 is a multispanning membrane pro- 
tein involved in many cellular processes, in- 
cluding angiogenesis and cell migration and 
as a “do not eat me” signal to macrophages 
(24). Differential use of exons 7 to 10 (Fig. 4C 
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and fig. S14F) composes a variably long cyto- 
plasmic tail (25). Immune cells—but also 
stromal and endothelial cells—have a distinct, 
consistent splicing pattern in CD47 that domi- 
nantly excludes two proximal exons and splic- 
ing directly to exon 8. In contrast to other 
compartments, epithelial cells exhibit a differ- 
ent splicing pattern that increases the length 
of the cytoplasmic tail by splicing more com- 
monly to exon 9 and exon 10 (Fig. 4D). Char- 
acterization of the splicing programs of CD47 
in single cells may have implications for under- 
standing the differential signaling activities 
of CD47 and for therapeutic manipulation 
of CD47 function. 


Cell state dynamics can be inferred from a 
single time point 

Although the Tabula Sapiens was created from 
a single moment in time for each donor, it is 
possible to infer dynamic information from 
the data. Cell division is an important tran- 
sient change of internal cell state, and we 
computed a cycling index for each cell type to 
identify actively proliferating versus quiescent 
or postmitotic cell states. Rapidly dividing pro- 
genitor cells had among the highest cycling 
indices, whereas cell types from the endothe- 
lial and stromal compartments, which are 
known to be largely quiescent, had low cycling 
indices (Fig. 5A). In intestinal tissue, transient 
amplifying cells and the crypt stem cells divide 
rapidly in the intestinal crypts to give rise to 
terminally differentiated cell types of the villi 
(26). These cells were ranked with the highest 
cycling indices, whereas terminally differ- 
entiated cell types, such as the goblet cells, 
had the lowest ranks (fig. S16A). To com- 
plement the computational analysis of cell 
cycling, we performed immunostaining of 
intestinal tissue for the MKI67 protein (com- 
monly referred to as K;-67) and confirmed 
that transient amplifying cells abundantly 
express this proliferation marker (fig. S16, 
B and C), which supports the conclusion that 
this marker is differentially expressed in the 
G2/M cluster. 

We observed several interesting tissue-specific 
differences in cell cycling. To illustrate one ex- 
ample, UMAP clustering of macrophages 
showed tissue-specific clustering of this cell 
type and that blood, bone marrow, and lung 
macrophages have the highest cycling indices 
compared with macrophages found in the 
bladder, skin, and muscle (fig. S16, D to G). 
Consistent with this finding, the expression 
values of cyclin-dependent kinase (CDK) in- 
hibitors (in particular the gene CDKNI1A), 
which block the cell cycle, have the lowest 
overall expression in macrophages from tis- 
sues with high cycling indices (fig. S16F). 

We used RNA velocity (27) as a further dy- 
namic approach to study transdifferentiation 
of bladder mesenchymal cells to myofibroblasts 
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(Fig. 5B). Latent time analysis, which provides 
an estimate of each cell’s internal clock using 
RNA velocity trajectories (28), correctly iden- 
tified the direction of differentiation (Fig. 5C) 
across multiple donors. Ordering cells as a 
function of latent time shows clustering of the 
mesenchymal and myofibroblast gene expres- 
sion programs for the most dynamically ex- 
pressed genes (Fig. 5D). Among these genes, 
ACTNI (alpha actinin 1)—a key actin cross- 
linking protein that stabilizes cytoskeleton- 
membrane interactions (29)—increases across 
the mesenchymal-to-myofibroblast transdiffer- 
entiation trajectory (fig. SIGH). Another gene 
with a similar trajectory is MYLK (myosin light- 
chain kinase), which also rises as myofibro- 
blasts attain more muscle-like properties (30). 
Finally, a random sampling of the most dy- 
namic genes shared across TSP1 and TSP2 
demonstrated that they share concordant 
trajectories and revealed some of the core 
genes in the transcriptional program underly- 
ing this transdifferentiation event within the 
bladder (fig. S161. 


Unexpected spatial variation in 
the microbiome 


The Tabula Sapiens provided an opportunity 
to densely and directly sample the human mi- 
crobiome throughout the gastrointestinal tract. 
The intestines from donors TSP2 and TSP14 
were sectioned into five regions: the duode- 
num, jejunum, ileum, and ascending and 
sigmoid colon (Fig. 6A). Each section was 
transected, and three to nine samples were 
collected from each location, followed by am- 
plification and sequencing of the 16S riboso- 
mal RNA (rRNA) gene. Uniformly, there was a 
high (~10 to 30%) relative abundance of Pro- 
teobacteria, particularly Enterobacteriaceae 
(Fig. 6B), even in the colon. Samples from each 
of the duodenum, jejunum, and ileum were 
largely distinct from one another, with sam- 
ples exhibiting individual patterns of bloom- 
ing or absence of certain families (Fig. 6B). 
These data reveal that the microbiota are 
patchy, even at a 3-inch (7.62-cm) length scale. 
We observed similar heterogeneity in both 
donors (fig. $17, A to C). In the small intestine, 
richness (number of observed species) was also 
variable and was negatively correlated with the 
relative abundance of Burkholderiaceae (Fig. 
6B); in TSP2, the Proteobacteria phylum was 
dominated by Enterobacteriaceae, which was 
present at >30% in all samples at a level nega- 
tively correlated with richness (fig. $17, A to C). 
In a comparison of species from adjacent 
regions across the gut, a large fraction of 
species was specific to each region (Fig. 6C), 
reflecting the patchiness. These data are de- 
rived from only two donor samples, and 
further conclusions about the statistics and 
extent of microbial patchiness will require 
larger studies. 
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We analyzed host immune cells in conjunc- 
tion with the spatial microbiome data; UMAP 
clustering analysis revealed that the small 
intestine T cell pool from TSP14 contained 
a population with distinct transcriptomes 
(Fig. 6D). The most significant transcriptional 
differences in T cells between the small and 
large intestine were genes associated with 
trafficking, survival, and activation (Fig. 6E 
and table S8). For example, expression of the 
long noncoding RNA MALATI, which affects 
the regulatory function of T cells, and CCR9, 
which mediates T lymphocyte development 
and migration to the intestine (37), were high 
only in the small intestine, whereas GPR15 
(colonic T cell trafficking), SELENBP1 (selenium 
transporter), ANXAI (repressor of inflamma- 
tion in T cells), KLRC2 (T cell lectin), CD24 
(T cell survival), GDF15 (T cell inhibitor), and 
RARRES2 (T cell chemokine) exhibited much 
higher expression in the large intestine. Within 
the epithelial cells, we observed distinct tran- 
scriptomes between small and large intestine 
Paneth cells and between small and large in- 
testine enterocytes, whereas there was some 
degree of overlap for each of the two cell types 
for either location (fig. S17, E and F). The site- 
specific composition of the microbiome in the 
intestine, paired with distinct T cell populations 
at each site, helps define local host-microbe 
interactions that occur in the gastrointestinal 
tract and is likely reflective of a gradient of 
physiological conditions that influence host- 
microbe dynamics. 


Conclusion 


The Tabula Sapiens is part of a growing set 
of data that, when analyzed together, will en- 
able many interesting comparisons of both 
a biological and technical nature. Studying 
particular cell types across organs, datasets, 
and species will yield new biological insights— 
as shown with fibroblasts (32). Similarly, com- 
paring fetal human cell types (33) with those 
determined in this work in adults may give 
insight into the loss of plasticity from early 
development to maturity. Having multior- 
gan data from individual donors may facili- 
tate the development of methods to compare 
diverse datasets and yield understanding of 
technical artifacts from various approaches 
(8, 9, 34, 35). The Tabula Sapiens has en- 
abled discoveries relating to shared behav- 
ior and organ-specific differences across cell 
types. For example, we found T cell clones 
shared between organs and characterized 
organ-dependent hypermutation rates among 
resident B cells. Endothelial cells and macro- 
phages are cell types that are shared across 
tissues but often show subtle tissue-specific 
differences in gene expression. We found 
an unexpectedly large and diverse amount 
of cell type-specific RNA splice variant usage 
and discovered and validated many previously 
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Fig. 5. Dynamic changes in cell state. (A) Cell types ordered by magnitude 
of cell cycling index per donor (each a separate color), with the most highly 
proliferative at the top and quiescent cells at the bottom of the list. (B) RNA velocity 
analysis demonstrating mesenchymal-to-myofibroblast transition in the bladder. 


undiscovered splices. These are but a few 
examples of how the Tabula Sapiens repre- 
sents a broadly useful reference to deeply 
understand and explore human biology at 


cellular resolution. 
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Materials and methods summary 

Fresh, whole, and nontransplantable organs, 
or 1- to 2-cm? organ samples, were obtained 
from surgery and then transported on ice by 
courier to tissue expert laboratories, where 
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Dynamical genes 


@ Mesenchymal cell 1 
@ Mesenchymal cell 2 
@ Mesenchymal cell 3 
@ Myofibroblast 


Myofibroblast 


Latent time 


The arrows represent a flow derived from the ratio of unspliced to spliced 
transcripts, which in turn predicts dynamic changes in cell identity. (© and D) Latent 
time analysis of the mesenchymal-to-myofibroblast transition in the bladder, 
demonstrating stereotyped changes in gene expression trajectory. 


they were immediately prepared for transcrip- 
tome sequencing. Single-cell suspensions were 
prepared for 10x Genomics 3’ V3.1 droplet- 
based sequencing and for FACS-sorted 384- 
well plate smart-seq2. Preparation began with 
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Fig. 6. High-resolution view highlights patchiness of the gut microbiome. 
(A) Schematic (left) and photo (right) of the colon from donor TSP2, 

with numbers 1 to 5 representing microbiota sampling locations. (B) Relative 
abundances and richness (number of observed species) at the family level 
in each sampling location, as determined by 16S rRNA sequencing. The 
Shannon diversity, a metric of evenness, mimics richness. Variability in 
relative abundance and/or richness or Shannon diversity was higher in the 
duodenum, jejunum, and ileum compared with the ascending and sigmoid 
colon. (©) A Sankey diagram showing the inflow and outflow of microbial 
species from each section of the gastrointestinal tract. The stacked bar for 


dissection, digestion with enzymes, and phys- 
ical manipulation; tissue-specific details are 
available in the complete materials and meth- 
ods (72). Cell suspensions from some organs 
were normalized by major cell compartment 
(epithelial, endothelial, immune, and stromal) 
using antibody-labeled magnetic microbeads 
to enrich rare cell types. cDNA and sequenc- 
ing libraries were prepared and run on the 
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subsequent section, and 


transcriptome profiles in 


in trafficking, survival, an 


Illumina NovaSeq 6000 with the goal to ob- 
tain 10,000 droplet-based cells and 1000 plate- 
based cells for each organ. Sequences were 
demultiplexed and aligned to the GRCh38 
reference genome. Gene count tables were 
generated with CellRanger (droplet samples) 
or STAR and HTSEQ (plate samples). Cells with 
low unique molecular identifier (UMI) counts 


log,(fold-change) 


each gastrointestinal section represents the number of observed species 
in each family as the union of all sampling locations for that section. The 
stacked bar flowing out represents gastrointestinal species not found in the 


the stacked bar flowing into each gastrointestinal 


section represents the species not found in the previous section. ASVs, 
amplicon sequence variants. (D) UMAP clustering of T cells reveals distinct 


the distal and proximal small and large intestines. 


(E) Dots in volcano plot highlight genes up-regulated in the large (left) 
and small (right) intestines. Labeled dots include genes with known roles 


d activation. 


cells were filtered to remove barcode-hopping 
events and filtered for ambient RNA using 
DecontX. Sequencing batches were harmonized 
using scVI and projected to two-dimensional 
(2D) space with UMAP for analysis by the tissue 
experts. Expert annotation was made through 
the cellxgene browser and regularized with a 
public cell ontology. Annotation was manually 


or low gene counts were removed. Droplet 
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QC checked and cross-validated with PopV, an 
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annotation tool that uses seven different au- 
tomated annotation methods. For complete 
materials and methods, see the supplemen- 
tary materials (2). 
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HUMAN CELL ATLAS 


Single-nucleus cross-tissue molecular reference 
maps toward understanding disease gene function 


Gékcen Eraslan{, Eugene Drokhlyansky{}, Shankara Anand+{, Evgenij Fiskint, Ayshwarya Subramaniant, 
Michal Slypert, Jiali Wang+, Nicholas Van Wittenberghe, John M. Rouhana, Julia Waldman, Orr Ashenberg, 
Monkol Lek, Danielle Dionne, Thet Su Win, Michael S. Cuoco, Olena Kuksenko, Alexander M. Tsankov, 
Philip A. Branton, Jamie L. Marshall, Anna Greka, Gad Getz, Ayellet V. Segré*§, Francois Aguet*§, 

Orit Rozenblatt-Rosen*§, Kristin G. Ardlie*§, Aviv Regev*§ 


INTRODUCTION: Understanding and treating 
disease requires deep, systematic character- 
ization of different cells and their interactions 
across human tissues and organs, along with 
characterization of the genetic variants that 
causally contribute to disease risk. Recent 
studies have combined single-cell atlases of 
specific human tissues and organs with genes 
associated with human disease to relate risk 
variants to likely cells of action. However, it 
has been challenging to extend these studies 
to profile multiple tissues and organs across 
the body, conduct studies at population scale, 
and integrate cell atlases from multiple organs 
to yield unified insights. 


RATIONALE: Because of the pleiotropy and spe- 
cificity of disease-associated variants, system- 
atically relating variants to cells and molecular 


processes requires analysis across multiple tis- 


Cross-tissue snRNA-seq atlas in 


sues and individuals. Prior cell atlases primar- 
ily relied on fresh tissue samples from a single 
organ or tissue. Single-nucleus RNA sequencing 
(snRNA-seq) can be applied to frozen, archived 
tissue and captures cell types that do not survive 
dissociation across many tissues. Deep learning 
methods can integrate data across individuals 
and tissues by controlling for batch effects 
while preserving biological variation. 


RESULTS: We established a framework for mullti- 
tissue human cell atlases and generated an atlas 
of 209,126 snRNA-seq profiles from eight tis- 
sue types across 16 individuals, archived as 
frozen tissue as part of the Genotype-Tissue 
Expression (GTEx) project. We benchmarked 
four protocols and show how to apply them in 
a pooled setting to enable larger studies. We 
integrated the cross-tissue atlas using a con- 
ditional variational autoencoder, annotated it 


Frozen archived tissue collection 


eight frozen, archived adult 
human tissues. Tissue sites and 
experimental pipeline (top row). The 
resulting atlas enables a cross- 
tissue census of tissue-specific and 
shared cell types (middle left). 
Differentiation trajectories and 
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Robust and scalable computational 
methods enable comprehensive 
associations of monogenic and 
complex diseases to tissue- 
specific and shared cell populations 
(bottom center and right). 
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Shared and tissue-specific 
features of fibroblasts 


Dichotomous macrophage populations 


Monogenic disease gene enrichment 


with 43 broad and 74 fine categories, and dem- 
onstrated its use to decipher tissue residency, 
such as a macrophage dichotomy and lipid as- 
sociations that are preserved across tissues, and 
tissue-specific fibroblast features, including lung 
alveolar fibroblasts with likely roles in mechano- 
sensation. We relate cells to human disease biol- 
ogy and disease-risk genes for both rare and 
common diseases, including rare muscle dis- 
ease gene groups enriched in distinct subsets 
of myonuclei and nonmyocytes, and cell type- 
specific enrichment of expression and splicing 
quantitative trait locus (QTL) target genes 
mapped to genome-wide association study loci. 


CONCLUSION: Our framework will empower 
large, cross-tissue population and/or disease 
studies at single-cell resolution. These frame- 
works and the cross-tissue perspective provided 
here will form a basis for larger-scale future 
studies to improve our understanding of cross- 
tissue and cross-individual variation of cellular 
phenotypes in relation to disease-associated 
genetic variation. 
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Understanding gene function and regulation in homeostasis and disease requires knowledge of the cellular 
and tissue contexts in which genes are expressed. Here, we applied four single-nucleus RNA sequencing 
methods to eight diverse, archived, frozen tissue types from 16 donors and 25 samples, generating a cross- 
tissue atlas of 209,126 nuclei profiles, which we integrated across tissues, donors, and laboratory methods with 
a conditional variational autoencoder. Using the resulting cross-tissue atlas, we highlight shared and tissue- 
specific features of tissue-resident cell populations; identify cell types that might contribute to neuromuscular, 
metabolic, and immune components of monogenic diseases and the biological processes involved in their 
pathology; and determine cell types and gene modules that might underlie disease mechanisms for complex 


traits analyzed by genome-wide association studies. 


issue homeostasis and pathology arise 

from an intricate interplay between dif- 

ferent cell types, such that disease risk 

is influenced by variation in genes that 

affect the cells’ functions and interactions. 
Human genetics studies, to date, have mapped 
tens of thousands of loci that either underlie 
rare monogenic disease or are associated with 
complex polygenic disease risk (7-3), including 
many in regulatory regions, whereas single- 
cell genomics has become instrumental in 
constructing cell atlases of both healthy organs 
and diseased tissues (4-6). 
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Coupling these advances in human genetics 
and single-cell genomics should enhance our 
understanding of cell type-specific changes in 
the function and regulation of disease genes. 
In particular, tissue (7), cell type (8-11), time 
point, and stimulation (/2-/4) all affect gene 
expression in disease-associated genetic loci. 
Recently, studies combining single-cell expres- 
sion atlases and genetic signals have been able 
to associate risk genes with specific cell types 
and states in relevant tissues (15-18). 

Because complex diseases often manifest in 
and implicate cells across multiple tissues, 
fully understanding the way in which genetic 
variation affects disease requires generating 
atlases from diverse tissues across the body 
and from many individuals, spanning differ- 
ent populations. This poses several challenges. 
First, collecting fresh tissue samples at scale is 
logistically challenging, and some tissues, such 
as brain, muscle, and adipose, are difficult to 
process into single-cell suspensions (19-23). As 
a result, large-scale single-cell profiling studies 
in human populations (24, 25) have focused on 
peripheral blood mononuclear cells, which can 
be frozen and thawed for multiplexed single- 
cell analysis. Single-nucleus RNA sequencing 
(snRNA-seq) offers a compelling alternative 
because it can be applied to archived, frozen 
tissues (26, 27) from multiple organs and cap- 
tures diverse cell types. Second, annotation 
and classification of cell types and states re- 
quire defining biological relationships be- 
tween parenchymal, immune, and stromal cells 
across tissue types. Finally, data integration 
and interpretation require cross-tissue ana- 
lytical frameworks to remove unwanted vari- 
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ation while preserving biological differences; 
identify cell types and states; and relate cell 
types and states to monogenic and complex 
trait genetics. 


Results 

A multitissue, multi-individual single-nucleus 
reference atlas from archived, frozen 
human tissues 


We constructed a cross-tissue snRNA-seq atlas 
from 25 archived, frozen tissue samples, pre- 
viously collected and banked by the Genotype- 
Tissue Expression (GTEx) project (7), that span 
three or four samples from each of eight tissue 
sites—breast, esophagus mucosa, esophagus 
muscularis, heart, lung, prostate, skeletal mus- 
cle, and skin—from 16 individuals (seven males 
and nine females) (Fig. 1A). We selected the 
samples by RNA quality, tissue autolysis score, 
and the availability of existing bulk RNA-seq 
and genome sequencing data [(28); table S1]. 
Histology slides corresponding to each tissue 
were reviewed by a pathologist to provide de- 
tailed annotations (table S1). Because different 
nucleus extraction protocols can be optimal 
for different tissues (26, 29), we isolated nuclei 
from each sample using four protocols that 
vary in detergents, salt, buffer, and mechanical 
preparation methodology [CST (0.49% CHAPS 
detergent, salts, and Tris buffer), NST (NP-40, 
salts, and Tris buffer), TST (0.03% Tween 20 
detergent, salts, and Tris buffer), and the EZ 
nuclei isolation kit (proprietary composition; 
NUC101, Sigma-Aldrich); (26, 28, 29); table S1], 
followed by droplet-based single-cell RNA-seq 
(scRNA-seq) (28). 

We processed the initial snRNA-seq profiles 
to retain high-quality nuclei profiles and re- 
move the effects of contaminant transcripts 
from ambient RNA (28). In breast and skin, the 
majority of nuclei profiles were recovered from 
only one individual sample for each tissue 
(breast: 61.3%, skin: 93.1%; table S1). Some 
tissues and protocols had higher ambient RNA 
contamination, reflected as spurious expres- 
sion of highly expressed transcripts from one 
cell type in nuclei profiles of other cell types. 
Such effects were more prominent in skeletal 
muscle and heart [false discovery rate (FDR) < 
0.05], irrespective of protocol, but were also 
present in other tissues [(28); fig. S1]. We cor- 
rected for ambient RNA contamination with 
CellBender v2.1 (30) (fig. SL) and further ap- 
plied standard quality control metrics (28), 
retaining 209,126 nuclei profiles across the 
eight tissues, with a mean of 918 genes and 
1519 transcripts (unique molecular identifiers) 
detected per profile. 


Cross-tissue atlas annotation recovers diverse 
cell types, including difficult-to-profile and rare 
cell subsets 


We integrated data from all samples and meth- 
ods using a conditional variational autoencoder 
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Fig. 1. Cross-tissue snRNA-seq atlas in eight archived, frozen adult human tissues. (A) Study design. (B to F) Cross-tissue single-nucleus atlas. Uniform manifold 
approximation and projection (UMAP) representation of single-nucleus profiles (dots) colored by main compartments (B), broad cell types (C), tissues (D), isolation protocol 
(E), and individual donors (F). (G) Cell-type composition across tissues. The overall proportion of cells (%) of each type and number of nuclei profiled in each tissue (rows) 
are shown. Numbers in circles indicate the corresponding broad cell type. Black vertical lines indicate the relative proportion of nuclei from each individual. 


(cVAE), which is designed to correct for multi- 
ple sources of variation in expression, such as 
individual-, sex-, and protocol-specific effects, 
while preserving tissue- and cell type-specific 
variation [(28); Fig. 1, B to F, and figs. S2A and 
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S3]. We benchmarked the cVAE against sev- 
eral other data integration methods, obtaining 
comparable or improved results and providing 
guidelines for future integration efforts [fig. S4 
and supplementary text note S1; (28)]. Cells 
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grouped first by cell type and then by tissue- 
specific subclusters (Fig. 1, B to D), suggest- 
ing that the variation between cell types is 
larger than the variation within a cell type 
across tissues. 
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We annotated cell types within each tissue 
after dimensionality reduction and graph-based 
clustering (28) by identifying genes that are 
differentially expressed between clusters and 
comparing them with literature-based marker 
genes [(28); tables S2 and S3]. We curated 
comprehensive lists of cell-type markers from 
the literature for each tissue (figs. S5 and S6 
and table S3), including markers for relatively 
poorly characterized cells, such as interstitial 
cells of Cajal (ICCs). We defined cellular com- 
partments shared across tissues (e.g., adipose, 
endothelial, epithelial, fibroblast, immune, 
muscle) (Fig. 1B), broad cell types (e.g., luminal 
epithelial cells, vascular endothelial cells) (Fig. 
1C and fig. S5), and granular cell subsets (e.g., 
luminal epithelial cell 1 and 2) (fig. S6). The 
annotations were consistent across extraction 
protocols, tissues, and donors (figs. $2, B and 
D, and S7). 

The atlas features 43 broad cell classes (Fig. 
1C and tables $2 and S3), with both tissue- 
shared cell types and tissue-specific subsets 
(e.g., Fig. 1G and figs. S2, B and D, and S5). 
For example, tissue-specific cell types such 
as pneumocytes (alveolar type I and II) and 
keratinocytes were the predominant cell types 
in the lung and skin, respectively. Many shared 
broad cell types such as immune and stromal 
cells were detected across all tissues (fig. S2, 
D and E), but with tissue-specific specializa- 
tions (discussed later in the text). For example, 
macrophages made up the largest immune 
population, with diverse subsets of tissue- 
resident cells. 

The atlas captured profiles from cell classes 
that are difficult to profile by dissociation-based 
scRNA-seq (23, 31, 32), including 2350 adipo- 
cytes, 21,607 skeletal muscle myonuclei, and 
9619 cardiac myonuclei. We detected adipo- 
cytes in five of the eight tissue types (breast, 
muscle, heart, esophagus muscularis, and skin), 
with 86% of adipocytes from breast tissue (fig. 
$2D), making up 18% of all breast nuclei pro- 
files (Fig. 1G) (28). Skeletal and cardiac myo- 
nuclei included key subsets (33, 34). Cardiac 
myonuclei primarily included the previously 
distinguished classical myonuclei as well as 
the recently reported “cytoplasmic myonu- 
clei” (33) (fig. S8 and supplementary text note 
$2). Other myonuclei subsets included neuro- 
muscular junction (NMJ)-localized skeletal 
muscle myonuclei, which have also been ob- 
served in scRNA-seq and snRNA-seq studies 
in mice (2/, 37, 35), and “fast-twitch” and “slow- 
twitch” subtypes (Fig. 1, B and C, and fig. S6G), 
which are characterized by differentially ex- 
pressed genes (fig. S9, A and B) that are con- 
cordant with previously reported markers (36) 
(fig. S9, C and D). 

Cross-tissue and cross-sample integration 
enhanced our ability to resolve multiple rare 
cell subsets (Fig. 1C and figs. S2, B and D, S5, 
and S6). For example, we detected Schwann 
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cells that support peripheral nerves (37) in 
multiple tissues (esophagus mucosa and mus- 
cularis, heart, prostate, and skeletal muscle), 
rare neuroendocrine cells in the prostate (38), 
and rare (26) ICCs and enteric neurons in the 
esophagus. Because these rare cells can contrib- 
ute to various pathologies, their profiling in hu- 
man tissues will enable disease studies (26, 37). 


snRNA-seq protocols perform well across 
tissues and correspond to scRNA-seq 


We benchmarked the performance of our nu- 
cleus extraction and profiling protocols (26, 29) 
relative to each other across all eight profiled 
tissues and to other snRNA-seq, scRNA-seq, 
and bulk RNA-seq datasets in relevant tis- 
sues. For each dataset, we compared standard 
quality control metrics per profiled cell or 
nucleus, as well as the diversity and propor- 
tions of captured cell types. 

Of the four tested nucleus isolation pro- 
tocols (CST, NST, TST, and EZ; table S1), the 
EZ protocol displayed lower performance in 
each of the eight profiled tissues by multiple 
quality metrics (28, 39) (Fig. 2A and fig. S10). 
These included the lowest total number of 
nuclei captured (fig. S10, A and B), higher levels 
of ambient RNA (FDR < 0.05; fig. S1, A and B), 
and separate grouping of EZ-profiled samples 
[fig. S2C; (28)]. 

The extraction protocols also varied in the 
proportion of nuclei recovered from each cell 
type (figs. S2, B and D, and S11A; supplemen- 
tary text note S3), consistent with our pre- 
vious observations in tumors (29). The TST, 
CST, and NST protocols had comparable cell- 
type diversity as measured by Shannon en- 
tropy [Fig. 2A; (28)], whereas the EZ protocol 
resulted in significantly lower diversity (Fig. 
2A; linear mixed-effects model effect size of 
-1.08, P = 5 x 10”). Overall, TST yielded the 
highest cell-type diversity, on average, across 
tissues (Fig. 2A) and significantly higher pro- 
portions of T cells, fibroblasts, and vascular 
endothelial cells (FDR < 10%; fig. SI1A). Be- 
cause the protocols varied by their perform- 
ance (most diverse, high capture of the desired 
cell types), users should choose protocols by 
matching protocol features to scientific goal, 
tissue type, and complexity; and further pro- 
tocol optimization may still be required (29) 
(for further details and guidance, see supple- 
mentary text note S3). 

We compared cell-type compositions be- 
tween our four protocols and other snRNA-seq 
studies, focusing on our frozen heart left- 
ventricle samples, for which two recently pub- 
lished snRNA-seq studies evaluated similar 
samples (33, 34). We found agreement in 
broad cell types such as mast cells, adipocytes, 
“cytoplasmic” cardiac myonuclei, and Schwann 
cells (fig. S11, B and C, and table S4)—but 
differences in some of their proportions (fig. 
S11, D and E). Protocols used in the published 
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studies and the EZ protocol in our study cap- 
tured a higher proportion of muscle nuclei, 
whereas CST, NST, and TST yielded a higher 
proportion of endothelial cells (fig. SIIE). There 
was also high concordance between the expres- 
sion profiles of bulk RNA-seq [from GTEx; (7)] 
and pseudobulk profiles derived from our 
snRNA-seq (accuracy 92.3%; fig. S12). A few 
samples showed lower agreement (heart-EZ, 
breast-EZ, and two breast-TST samples), sug- 
gesting that these particular tissue-protocol 
combinations may not reflect cellular compo- 
sition as accurately. 

We next compared snRNA-seq data to fresh- 
tissue scRNA-seq data from lung (40), skin 
[current study; (28)], and prostate (38). For 
cell composition (Fig. 2B), we recovered the 
same main cell groups across compartments. 
We confirmed the accuracy of our annotations 
by training a multiclass random forest classi- 
fier on snRNA-seq data and predicting cell 
types on scRNA-seq data [(28); Fig. 2, C to E], 
and vice versa (fig. S11, F to H). In addition, cell- 
type intrinsic (pseudobulk) profiles of protein- 
coding genes were overall similar between 
snRNA-seq and scRNA-seq [average Spearman 
p across cell types of 0.58 (skin), 0.69 (prostate), 
and 0.47 (lung); table S4]. Moreover, integrat- 
ing cell and nuclei profiles from prostate, skin, 
and lung and annotating the cells with a ran- 
dom forest classifier that is trained on our 
nuclei profiles with our granular annotations 
yielded well-mixed groupings, similar marker 
genes, and high concordance between proto- 
cols (fig. S13). 

Divergences observed include the greater ex- 
pression in cells versus nuclei of a dissociation- 
induced stress signature (41, 42) (Wilcoxon rank 
sum test, Benjamini-Hochberg FDR < 107"; 
Fig. 2F and fig. S14, A and B), as reported (29), 
and of ribosomal and nuclear-encoded mito- 
chondrial protein genes [Fig. 2G; (28); linear 
model], consistent with their longer half-lives 
and higher cytoplasmic levels (43, 44). Con- 
versely, nuclei profiles had higher levels of 
longer transcripts (fig. S14, H and I) and of 
transcripts with a larger number of adenine 
stretches (Fig. 2G and fig. S14, C to G), con- 
sistent with previous reports (45). 

Notably, our snRNA-seq generally captured 
relatively lower proportions of lymphocytes. 
For example, for lung and skin, respectively, 
T cells represented 1.7 and 1.4% of all cells 
(aggregated) compared with 8.73 and 6.83% 
by scRNA-seq. We observed similar patterns 
for B cells in skin. Furthermore, these immune 
cell proportions varied across samples and 
protocols. A study comparing snRNA-seq 
and in situ measurements (46) suggested that 
scRNA-seq may oversample immune cells. 


Myeloid populations across tissues 


Our cross-tissue atlas allowed us to charac- 
terize tissue-specific and shared features of 
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Fig. 2. Concordance of cell-type diversity and cell-intrinsic profiles 
between snRNA-seq and scRNA-seq. (A) Cell-type diversity (Shannon entropy, 
y axis) of each protocol (color) in each sample (dot) and tissue (x axis). Dashed 
lines indicate the average across samples. (B) Differences in cell proportions. 
The proportions (y axis) of cells from major categories (color) in each individual 
by tissue and protocol (x axis) are shown. (€ to E) Concordance of cell-intrinsic 
programs. Proportions of cells (dot color and size) of a manually annotated group 
(rows) predicted to belong to a given nucleus profile annotation label (columns) by a 
random forest classifier trained on nuclei and applied to cells of the same tissue for 
skin (C), lung (D) or prostate (E) are shown. (F) Tissue dissociation expression 
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signatures in scRNA-seq. Scores [y axis, average background corrected log(TP1OK+1)] 
of a dissociation-related stress signature (41) in scRNA-seq (pink) and snRNA-seq 
(blue) profiles in each major lung cell type (x axis) are shown (***Benjamini-Hochberg 
FDR < 10°, Wilcoxon rank sum test). Box plots show median, quartiles, and 
whiskers at 1.5 times the interquartile range (IQR). (G) Divergent genes between 
cell and nucleus profiles. Averaged pseudobulk expression (28) of protein 
coding genes (dots) in skin basal keratinocyte nuclei (x axis) and cells (y axis) is 
shown. Divergent genes are represented by a black dot outline. The color scale 
shows the total length of polyA stretches with at least 20 adenine bases in logs 
scale. Epi., epithelial; sm., smooth; SMC, smooth muscle cell. 
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tissue-resident immune cells, which play key 
roles in immune surveillance and tissue sup- 
port (47, 48). Integration and annotation of 
14,156 myeloid nuclei profiles (28) (60% of im- 
mune nuclei) revealed 14 distinct monocyte, 
macrophage, and dendritic cell (DC) subsets 
(Fig. 3A; fig. S15, A and B; and table S5). 
These included CD16" monocytes, CD14* mono- 
cytes, two transitional Mo/M® FCGR3A™ and 
Mo/M® FCGR3A"™" populations with coex- 
pression of both monocyte and macrophage 
markers (see next section), DC1s, DC2s (49), 
mature DCs, and Langerhans cells (Fig. 3B). 
Tissue macrophage states further included lung 
macrophages expressing an alveolar macrophage 
signature (50) (fig. S15C), proliferating macro- 
phages, cytokine- and chemokine-expressing 
inflammatory macrophages, and two addi- 
tional macrophage subsets: M® LYVEI high and 
M® HLAIr" (Fig. 3B), where HLAII is HLA 
class II. Finally, lipid-associated macrophage 
(LAM)-like nuclei highly expressed LAM signa- 
ture (Fig. 3B and fig. SI5C) (57) as well as lipid 
metabolism-related, myeloid cell immune ac- 
tivation, and macrophage migration genes 
(fig. SI5D). Although most myeloid subsets 
were present in multiple tissues, notable ex- 
ceptions included PPARG?& lung macrophages, 
which were present only in lung, and CD207/ 
Langerin* Langerhans cells, which were pre- 
sent only (97%) in skin and esophagus mucosa, 
consistent with their role in antigen sampling 
within stratified epithelia (52, 53) (Fig. 3C and 
fig. S15, F and I). 

Myeloid cell proportions were more highly 
correlated between samples within a tissue 
type (fig. S15, E and F) than between different 
tissues (fig. $15, E to H), confirming the re- 
producibility of tissue-specific myeloid state 
proportions. Moreover, related tissues—such 
as muscle (heart, esophagus muscularis, skel- 
etal muscle) or epithelial barriers (esophagus 
mucosa, skin)—grouped by their myeloid com- 
position profiles (fig. SI5H). Macrophage types 
and proportions varied by tissue, with breast, 
esophagus mucosa, esophagus muscularis, 
heart, and skeletal muscle having signifi- 
cantly higher proportions of M® LYVEI'8" 
macrophages [P < 10~°, Dirichlet regression 
likelihood ratio test (LRT); (28)] and lung and 
prostate having significantly higher propor- 
tions of M® HLAII"®" [P < 10-°, Dirichlet re- 
gression LRT; (28)] (Fig. 3C and fig. S15, F and I). 


A dichotomy between LYVE1- and 
HLAlI-expressing macrophages is preserved 
across tissues 


Two expression states of LYVE® and HLAI™® 
macrophage populations were dichotomous— 
either LYVE!"'HLAM or LYVEP*HLAI"'"— 
and represented the end points of two alterna- 
tive branches. Specifically, low-dimensional 
representation of monocytes, macrophages, 
and Mo/M@ populations as a continuum with 
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diffusion maps captures the HLAIr#" and 
LYVE?™®® cells as “terminal” points in two 
branches that emanate from CD14* monocytes 
at the root. Each of the two terminals is pre- 
ceded by distinct earlier putative transitional 
states: a Mo/M® FCGR3A"™ state between 
CD14* monocytes and the LYVET®"HLAIro” 
population and a Mo/M® FCGR3A"2" state 
between the monocytes and the LYVEL°” 
HLAII™®* cells (Fig. 3D and fig. S16, A and B). 
(There is also a putative secondary path be- 
tween FCGR3A™ and LYVELHLAIF=* cells 
through a FCGR3Ab intermediate.) The po- 
sition of the FCGR3A™* transitional state is 
consistent with lung data from a humanized 
mouse model (54, 55). These key features are 
consistent overall in the map that is con- 
structed only from macrophage nuclei from a 
single tissue (fig. SIGA). Thus, FCGR3A” and 
FCGR3A"*" states might be Mo/M® popu- 
lations that are less-differentiated or less- 
activated states of LYVET™®" and HLAII™®* 
M@s, respectively. 

Each of the LYVET™®" and HLAII™®" subsets 
expressed a combination of a common signa- 
ture and tissue-specific markers (Fig. 3E, fig. 
S16C, and table S5) and was enriched for dis- 
tinct functions, mirroring those of Lyver'® 
MHCcIl™ and Lyver’MHCII" resident mac- 
rophage populations in mouse tissues (47) 
(fig. S16, C and E). HLAII™2" cells were en- 
riched for immune-related processes and 
expressed higher levels of complement com- 
ponents APOE and CIQA, CIQB, and CIQC 
(Fig. 3B and fig. S16, C and D). Genes differ- 
entially expressed between human HLA 
and LYVE1™®" subsets corresponded to those 
in murine counterparts (fig. SI6E), but with 
higher expression in human HLAIT"®" macro- 
phages of CIQB and CIQC complement genes 
in lung and phagocytic receptors MARCO and 
CD36 in heart (fig. SIGE). ALATA macrophages 
were also enriched for immune interactions 
with B cells, DCs, mast cells, natural killer (NK) 
cells, and T cells (fig. S16G). LYVEE®» profiles 
were enriched for tissue-supporting mod- 
ules and had putative receptor-ligand inter- 
actions (28) with lymphatic endothelial cells, 
fibroblasts, adipocytes, and myocytes (fig. SI6F). 
In mice, M® Lyver'® cells were located near 
blood vessels (47) and regulated vascular tone 
(56), and cardiac LYVEI* macrophages have 
been implicated in regulating the lymphatic 
network (57). Thus, LYVED 8 macrophages 
may have a homeostatic role in the human 
heart, lung, and esophagus. 


LAM-like macrophages are prevalent across 
human tissues and share a regulatory program 


In our atlas, we identified LAM-like cells as 
widely distributed across healthy human tis- 
sues, with the vast majority of LAMs (97%, 
268 of 283) from the breast, heart, lung, and 
prostate (Fig. 3C). LAMs and LAM-like cells 
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have been previously reported in disease con- 
texts in adipose tissue from obese humans and 
mice (57), injured and fibrotic liver (58-60), 
obese liver (67), fibrotic lung (50, 62), athero- 
sclerotic aortic tissue (63, 64), leprosy (65), and 
the brains of individuals with Alzheimer’s dis- 
ease (66-68). However, an understanding of 
their distribution and heterogeneity across 
human tissues is still lacking. 

To characterize LAMs across the body, we 
analyzed LAM-like cells in the expanded con- 
text of our study and 17 other published atlases 
spanning 14 tissues. We trained a linear classi- 
fier with published omental adipose scRNA- 
seq containing LAMs (28, 57) and classified 
each myeloid profile in our dataset and the 
published compendium as LAM-like macro- 
phages, non-LAM macrophages, and non- 
macrophages. From this, we recovered 283 
LAM-like cells in our study and 4285 LAM- 
like cells in the 17 published studies (Fig. 3, 
F to H; fig. S17, A to D; and table S6). 

LAM-like cells were present among tissue- 
resident macrophages across a broad range of 
tissues and pathologies. These included adi- 
pose (57, 69) and atherosclerotic (70, 77) tissue, 
as reported, as well as healthy tissues [pla- 
centa (72), testis (73), kidney (74), pancreas 
(75), prostate (38), decidua (72), liver (76), 
ovary (76, 77), skeletal muscle (78), and intes- 
tine (26)] and other disease contexts [acne 
(79), leprosy (79) and atopic dermatitis (80, 87) 
(skin), and Crohn’s disease (ileum) (82)]. Mi- 
croglia from the central nervous system of 
epileptic patients (83) were also classified as 
LAM-like cells, indicating that microglia that 
express LAM signature genes extend beyond 
Alzheimer’s disease (66-68). LAM signature 
genes were enriched for genome-wide asso- 
ciation study (GWAS) genes associated with 
levels of high-density lipoprotein (HDL) and 
low-density lipoprotein (LDL) cholesterol, tri- 
glycerides, type 2 diabetes, and the tau to 
Af1-42 ratio in cerebrospinal fluid (fig. S17H 
and table S7), further supporting their role in 
lipid homeostasis. 

Although a core set of signature LAM genes 
was expressed across most tissues and studies, 
many other genes varied across tissues. For 
example, both CHITI and CTSK were highly 
expressed in LAMs from leprosy skin samples 
(but had very low expression in other skin 
LAMs) (Fig. 3H and fig. S17F), in line with 
higher serum chitotriosidase activity in leprosy 
patients (84). Select lipid pathway genes, in- 
cluding fatty-acid binding protein FABP4, 
lipoprotein lipase LPL, and phagocytic lipid 
receptor CD36, were highly expressed in LAMs 
from tissues with high adipose content, in- 
cluding adipose, atherosclerotic lesions, and 
intestine creeping fat samples, possibly reflect- 
ing increased lipid-induced transcriptional 
stimulation of these target genes under these 
conditions (85). LAMs in creeping fat from 
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Fig. 3. A dichotomy between LYVEI- and HLAll-expressing macrophages and 
LAM-like populations across tissues. (A) Myeloid profiles (dots), colored by cell type 
and state and overlaid with a PAGA graph of myeloid states (large nodes). (B) Expression 
of marker genes (columns) associated with each subset (rows). (€) Myeloid cell 
distribution across tissues. The overall proportion of myeloid cell subsets (colors) in each 
tissue (bars) is shown at the top, and the overall proportion of cells from each tissue in 
each subset (bars) is shown at the bottom. (D) LYVEI"®" and HLAII"@" macrophages are 
end points of two differentiation trajectories. A diffusion map of monocytes, macrophages, 
and transitional subsets (colors) is shown. Large circles represent centroids (sizes are 
proportional to population size). (E) Cross-tissue and tissue-specific markers. Expression 
of marker genes (columns) associated with two myeloid subsets (left) in each tissue 
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Crohn’s intestine and atherosclerotic lesions 
were additionally characterized by higher 
expression of interleukin JZ7B and multiple 
chemokines (CXCL8, CXCL3, CCL4), possibly 
reflecting the inflammatory environment under 
these conditions (fig. S17F). 

We predicted transcription factors (TFs) that 
could mediate LAM-like gene expression by 
inferring TF activity from target expression (28) 
and ranking TFs by the mean difference be- 
tween their activities in LAMs versus non-LAM 
macrophages [(28); Fig. 31]. LAM-associated 
TFs inferred across all classified LAM-like cells 
included PPARG, USFI1, and NR1H3 (LXRA) 
(Fig. 3J and fig. S17G), suggesting a shared 
core regulatory mechanism. These are major 
regulators of lipid metabolism-related expres- 
sion (86) and have been proposed to regulate 
Trem2 expression in mice (87). 


Shared and tissue-specific features 
of fibroblasts 


To characterize fibroblast heterogeneity (88), 
we analyzed 32,421 fibroblast nuclei profiles 
across the eight profiled tissues (fig. S18A), 
identifying shared and tissue-specific signa- 
tures. The cross-tissue, shared fibroblast ex- 
pression program consisted of markers that 
were significantly more highly expressed in 
fibroblasts than in nonfibroblast cell types 
within each tissue (FDR < 0.05, Welch’s ¢ test) 
and included multiple extracellular matrix 
(ECM) constituents (Fig. 4A and table S8). 
Conversely, the tissue-enriched fibroblast sig- 
natures were defined based on genes exclusive 
to or highly enriched in fibroblasts from a 
given tissue versus fibroblasts from all other 
tissues (Fig. 4, B to D, and fig. S18B). 

Tissue-enriched fibroblast features were con- 
sistent with the specific functions and inter- 
actions required in the respective tissues. For 
example, the esophagus mucosa fibroblast sig- 
nature (table S8) included genes involved in 
neuron and axon development (e.g., NTNI, 
PLXNBI, FGF13), suggesting interactions with 
the enteric nervous system, possibly through 
NTNI-DCC and NTNI-UNCS5C interactions 
(fig. SI8E). The cardiac fibroblast signature 
genes included TFs involved in cardiac devel- 
opment (e.g., GATA4 and GATA6), revealing 
that expression of these developmental TFs is 
retained in the adult cardiac fibroblast com- 
partment (89-92). The skeletal muscle signa- 
ture showed increased expression of the CXCLI4 
and CXCL12 chemokines and of components 
of the renin-angiotensin-aldosterone system 
(AGTRI and MME), which regulate skeletal 
muscle mass (93, 94) [reminiscent of the ex- 
pression of the same components by WNT2B* 
fibroblasts in a local renin-angiotensin system 
in the colon (/6)]. 

Lung fibroblast signatures were enriched 
for ECM, cation transport, and contractile func- 
tions (Fig. 4E), including multiple components 
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of the basement membrane (BM) that are re- 
quired for epithelial-mesenchymal adhesion 
[nephronectin (NPNT), FRASI, hemicentin-1 
(HMCNI]), and integrin ITGA8 (Fig. 4, F and 
G), which form a protein complex (95-98) that 
anchors NPNT to the BM (99-103)]. Mutations 
in ITGA8, FRAS1, and HMCN17 have been linked 
to Fraser syndrome, a congenital disorder 
that affects cell adhesion and results in skin, 
kidney, lung, and craniofacial abnormalities 
(104-107), and common variants in NPNT are 
significantly associated with chronic obstruc- 
tive pulmonary disease (COPD) and forced 
expiratory volume (FEV) in GWASs (Fig. 4H). 
Granular annotation of the lung fibroblast pro- 
files (40, 108) (fig. SI8C) suggests that the 
adhesion complex is expressed by alveolar fi- 
broblasts (Fig. 4F and fig. SI8D). Alveolar fibro- 
blasts also specifically expressed FGFR4 (Fig. 
4B), which is essential for alveologenesis in 
mice (09) and is genetically linked with bron- 
chopulmonary dysplasia that affects alveoli in 
humans (1/0). 

Of all tissue fibroblasts, lung alveolar fibro- 
blasts distinctively expressed a calcium sig- 
naling and actomyosin contractility program 
(Fig. 4, E and F, and fig. S18D), including the 
mechanosensitive calcium ion channel PIEZO2 
(Fig. 4G), which has been proposed to sense 
pulmonary stretch (777, 1/2); multiple calcium 
channels and adrenergic and purinergic recep- 
tors, which have been implicated in stimulat- 
ing calcium release from intracellular stores 
(CACNAID, TRPC6, MCOLN2/TRPML2); and 
myosin light chain kinase (VYLK), which is 
involved in mediating calcium-induced acto- 
myosin contraction (773). This suggests that 
alveolar fibroblasts could integrate mechani- 
cal stretch and forces (through PIEZO2) and 
neuronal excitatory signals (40), which could 
affect their migratory or mechanical proper- 
ties. Notably, although alveolar fibroblasts 
express myofibroblast markers (e.g., ACTA2; 
fig. SI8D), they are distinct from previously 
reported myofibroblasts (40), and our data 
suggest that they constitute a distinctive con- 
tractile and excitable fibroblast state. 


Intra- and cross-tissue cell-type associations 
with monogenic disorders 


Human genetics has identified numerous rare 
monogenic disease genes, and many have been 
experimentally mapped to cell type(s) of ac- 
tion (1/4, 115). To characterize the expression 
of monogenic disease genes across cell types, 
we related disease genes associated with dif- 
ferent phenotypes or disease categories in the 
Online Mendelian Inheritance in Man (OMIM) 
database (116) to the cell populations in which 
they are expressed in our cross-tissue atlas. 
Because OMIM entries are not organized by 
disease categories, we leveraged topic model- 
ing to aggregate 5812 genotype-phenotype 
associations based on similarities in text 
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descriptions of clinical features, resulting in 
229 distinct disease topics [(28); figs. S19 and 
$20). We then related topics to a cell type based 
on the enriched expression of the topic genes 
in the cell type (Fig. 5A, figs. S21 and $22, and 
table S9). 

Many topics mapped to their expected cell 
populations. For example, cardiac disease topics 
(topics 65 and 66) mapped to cardiac myonuclei 
and/or endothelial cells, immune and infec- 
tion topics (topics 132, 205, and 217) mapped 
to immune cells across tissues, and a diabetes 
and lipodystrophy topic (topic 222) mapped to 
adipocytes in skeletal muscle and skin (Fig. 5A 
and fig. S23A). Male infertility and sperm mo- 
tility topic (topic 129) was associated with 
ciliated lung cells and stromal cells (fig. S23B). 
The link between sperm dysmotility and bron- 
chitis is well established, with genes common 
to flagella and cilia perturbed across both 
sperm and lung (1/7); we further detect this 
specific link in topic 155, which associates 
ciliary dyskinesias and the lung ciliated epi- 
thelium (fig. $23C). 


Genes from monogenic muscle disease groups 
are enriched in distinct subsets of myocyte and 
nonmyocyte nuclei in three muscle types 


Among the monogenic disorders, muscle dis- 
ease phenotypes are a well characterized sub- 
set and are known to arise from mutations in 
genes that are expressed in myocytes (e.g., 
structural genes involved in contraction) and/ 
or other cells in the surrounding tissue (e.g., 
NMJ, ECM, and adipose tissue) (178-127). We 
leveraged the three muscle types represented 
in our atlas—cardiac, skeletal, and smooth 
muscle—to map 605 well-curated monogenic 
muscle disease genes (118) (table S10), recover- 
ing known biology and extending hypotheses 
beyond those obtained from bulk-tissue RNA-seq 
(122, 123). We tested disease groups (e.g., hered- 
itary cardiomyopathies, motor neuron diseases) 
for their enrichment with cell type-specific 
markers across our muscle tissues (FDR < 0.1) 
[(28); Fig. 5B, fig. S24, and table S11]. 

As expected, different disease gene sets were 
associated with different myonuclei subsets 
(113 of 605 genes; table S11) in patterns that 
recapitulated known disease mechanisms 
(Fig. 5B). For example, skeletal muscle myo- 
nuclei were associated with congenital myop- 
athy genes (FDR = 7.07 x 10°), and cardiac 
myonuclei were associated with hereditary 
cardiomyopathy genes (FDR = 4.24 x 107”). 
Some associations highlighted finer myonuclei 
subsets. For example, genes linked to congen- 
ital myasthenic syndrome, a disorder affecting 
neuromuscular transmission (124), were spe- 
cifically expressed in NMJ-localized myonuclei 
but not in other skeletal myonuclei (tables S2 
and S12). These included acetylcholine recep- 
tor subunits (CHRNE, CHRNAI, CHRND), the 
NMJ-organizing receptor tyrosine kinase MUSK, 
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Fig. 4. Shared and tissue-specific fibroblast features. (A and B) Expression 

in each tissue subset (rows) of marker genes (columns) distinguishing fibroblasts 
from nonfibroblasts across all tissues (A) or enriched i 
other tissues (B). (C€ and D) Fibroblast profiles (dots) 
expression of the most exclusive marker (D). (E) Sign 
X axis] of gene sets (y axis) enriched (FDR < 5%) in genes covarying with the 


and NMJ-enriched ECM components (COL13A1, 


LAMA2). 


Other disease gene sets were associated with 
nonmyocyte accessory cells, including neu- 


Eraslan et al., Science 376, eabl4290 (2022) 


13 May 2022 


n fibroblasts in one versus 
colored by tissue (C) or 
ificance [-logi9(FDR), 


rons, Schwann cells, fibroblasts, and adipo- 
cytes (127 genes; table $12), often mirroring 
clinical features, such as nervous system cells 
in neuropathies or adipocytes in metabolic 


associated 


lung-specific fibroblast signature. (F) Expression of ECM and cation transport genes 
(columns) in the covarying gene module in each granular fibroblast subtype in 

each tissue (rows). (G) /TGA8 and PIEZO2 (columns) expression in granular cell types 
(rows) in lung. (H) Significance (x axis) and Open Targets Genetics locus-to-gene 
score (color) of the most significant variants mapped to NPNT with a high (>0.5) locus- 
to-gene score in GWASs (y axis). FEV, forced expiratory volume. 


myopathies. In particular, Schwann cells were 


with hereditary motor and sensory 


neuropathies in all three tissues (FDR = 0.015 to 
0.06), but their association with Dejerine-Sottas 


8 of 17 


RESEARCH | RESEARCH ARTICLE 


A 


E. muscularis 


Enrichment of cell type markers in OMIM topics 
Heart 


Lung Prostate 


Skeletal muscle Skin 


7: muscle, weakness, muscles, legs, myopathic 
52: muscular, myopathy, fibers, kinase, creatine 
65; cardiac, hear, ventricular, ventrile, echocardiography 
66: cardiomyopathy, hypertrophy, diated, hypertrophic, wall 
59: block, sudden, qt, bilation, eog 
74; aotio, vaive, cardiovascular, mitral, marfen 
34: collagen, eds, connective, hypermobility, ragity 
149: pulmonary, fibrosis, cyst, lungs, tot 
124: lymphedema, gingival, Inna, progeroid, gp 
55 ;vessels, hemorrhage, intracranial, vessel, angiography 
56: vascular, artery, cutis, arteries, thrombosis 
119: skin, lesions, cutaneous, lesion, epidermal 
22: polydactyy, postaxal,joubert, bos, nephronophthsis, 
+155: ciary, arms, cia, outer, biadder 
129: men, sperm, infetity infertile, moti 
425: pigmentation albinism, brown, hypopigmentation, oca 4 iN ° 
213: pancreas, diabetic, insipidus, albumin, kot [ 6 ° 
72: peripheral, signs, degeneration, early-onset, responses H 
10: epilepsy, eeg, myoclonic, seizure, epileptic °° ° 
490: als, cag, neuron, tau, fd ° 
187: cognitve, cortical, neuronal, behavioral, behavior 
£84; attacks, paroxysmal, channels, convulsions, choreoathetosis 4 + 
"159: fat, subcutaneous, lipodystropiy, adipose, bscl2 6 


° 
eeee 


422: cholesterol, al, coronary, lipoprotein, hdl 
+ 222: diabetes, insulin, glucose, melts, fasting 

£83: conduction, electrophysiologc, action, cmt, potentials 
‘85: nerve, neuropathy, sensory, axonal, pes 

497: myelin, retarded, miro, intellect, bubs 4.4 

158: storage, lysosomal, gaucher, ml lysosomes 

247: inflammatory, fever, inlammation, arma, artis 44 
206: b-cell, scid, dermatitis, ige, disseminated 

132:t, immunodeficiency, el, immune, immunologic 


* Ore ° 


. 
Qt Seaton 
rs oTiue 0 False 

oe * 
log, (adj. p) 


123 4 5 


Log OR 


AAP eA SA EONR EEN DN AER ALS aN ee 
GP OY ROD GES 


«ok 
é 
ee 
Pe, 
< 
co 
“ig 
se 
iy 


Sk. muscle Heart 


Hereditary motor and sensory neuropathies 
(Dererne- Sota Hypertophie neuropathy) 7@) 
Congenital muscular dystrophies (Bethlem myopathy) e + 
Motor neuron diseases (lethal congenital contracture syndrome) | @ 


Hereditary motor and sensory neuropathies | @ roy 
° 


E. muscularis 


Motor neuron dise 
Congenital muscular dystrophies (Ulich congenital muse. dys. 
Metabotic myopathies (disorders of ipid metabolism) 

Metabolic myopathies 

Metabolic myopathies (glycolytic pathway) 


Other myopathies (myofirilar myopathy) 
Hereditary cardiomyopathesathythmogeniccrdomyopahies 
and related syndromes) 


Enriched in 
non-myocytes 


Hereditary cardiomyopathies 

Hereditary cardiomyopathies (nor-arthythmogenic) 
Congenital myopathies (nemaline myopathy) 
Congenital myopathies 

Other myopathies 

‘Congenital myasthenic syndromes 

Distal myopathies 


Significant 
Tue 0 False 


Enriched in 
myonuclei 


Log OR 


TEV) ote 
Corn ee 


a re) 
Wes a os 
RC 

we poe 


SA 
oe a 


© Atipocyte 

© Immune (NK cell) 
@ Satellite cel 

© Immune (mast cell) 


© Myonuclei (cardiac) 


Schwann cell @) 


Schwann cell @) 


I DC ERBB4” 
immune (DC) © . © Myonuclei (NMW-localized) 
Pericyte/SMC 

viersMC ©) © Myonucte (fast-twitch) 


Pericyte/SMC © 


©@ Myonuclei (skeletal muscle) 
Endothelial (vascular) @) © Immune (00) 


Myonuclei 


(NMJ-localized) @ © Endothelial 'ymphatic) 


© Endothelial (lymphatic) 


: ges 


pact oN © Endothelial (cardiac 
Myonuclei (slow-twitch, © Bereechers is aa = microvascular) 
_ 
extensors) © Immune (LAMs) 
Myonuclei (fast-twitch) ©) INHBA 
Myonuclei O © Adipocyte 
(cardiac, cytoplasmic) © 


BB. Oo © Endothelial (vascular) 
Myonuclei (skeletal muscle) @) ACVRI-ACVR: BMPE 


© Endothelial (vascular) 


© Fibrobiast 


“me 
BMP. 
© Pericyte/SMC 
© Pericyte/SMC 
:(G@ 


© Endothelial (vascular) 


o: 
Myonuclei (cardiac) @) ACVRI-BMPR2 


co4g**** 
Myonuclei (smooth muscle) @) 


on © Peticytersme 
3 NoToH2 © Asipocyte 
jyonuclei (smooth muscle 
TAGLN to) © es 
NPRI 
Endothelial (carioe ei, es @ Myonucle (cantar 
microvascular) Syoplasmis) 
Tissue © Adipocyte ‘© Myonuclei (cardiac) 
OE. muscularis © Endothelial (cardiac microvascular) @ Myonuclei (cardiac, cytoplasmic) 


© Endothelial (Iymphatic) 
@ © Endothelial (vascular) 
© © Fibroblast 
% © Immune (DC) 
© @ immune (LAMs) 
© Immune (NK cell) 
© Immune (mast) 
@ Myonuclei (NMUJ-localized) 


Myonuclei (fast-twitch) 

© Myonuclei (sk. muscle) 

© Myonuclei (slow-twitch, cytoplasmic) 
© Myonuclei (smooth muscle TAGLN lo) 
© Myonuclei (smooth muscle) 

© Pericyte/SMC 

© Satelite cell 

© Schwann cell 


O Heart 


O Skeletal muscle 
Expression level 
— High — Low 


Receptor/Ligand 
Bold = disease genes 


Fig. 5. Monogenic muscle disease genes related to cell types and interac- 

tions across cardiac, skeletal, and smooth muscle tissues. (A) Enrichment of 
monogenic disease groups to broad cell types. Effect size (log odds ratio, 
dot color) and significance [-logio(FDR), dot size] of enrichment of genes 
from disease topics [rows; (28)] in broad cell-type markers in each tissue 


log, (adj. p) 


,|©e0e@@ 
ir et 


° 
PAY SUR? me Eee 
ee Dh Us 

oe SORE ey 2 
ae - a oF 
- é > 


bocce ae 
a as a 


On 
we Bip 
oe es 


6 {Sk muscle 


4 
a3 
= 
QO2 

4 

0 


E. muscularis 


ps 
— 


Adipocyte 


Immune (128 CS _$-— 


Immune (mast cel 
Endothelial (vascular 


Immune (DCima 


Immune 
Myor 


Heart Esophagus 


Mi 


Endothelial (va 
Myonuclei (sm. 


Receptor Ligand = 
Satellite cell @) Eve 
eiaababalg 129 © i: pees @ Immune (0¢) OE. muscularis 
ndothelial (vascular) @) > D)vereeeeeeeee 
i © immune (Lams) | O Heart 
Endothelial (vascular) @) O Skeletal muscle 
Expression level 
Schwann cell @) LIcAM scan _—Q Schwann cell — High — Low 
‘Schwann cell @) Biss tencerstes an Sohwann call Receptor/Ligand 
6 sem Bold = disease genes 
@® Neuronal Cell type 
© Adipocyte 
@ Adipocyte ‘© Endothelial (vascular) 
@PericyteisMc | @ Fibroblast 
‘Schwann cell aa ‘@ Immune (DC) 
19) Wistadieseauss QO Fibroblast © Immune (LAMs) 
Neuronal @) NGFR a @ Adipocyte ‘© Neuronal 
Fibroblast © Pericyte/SMC 
Fibroblast @ Satellite cell 
© Adipocyte © Schwann cell 
G Gene _Disease group Disease subgroup Disease phenotype 
DAGT Muscular dystrophies Limb girdle muscular TGMDRI6 - (AR) 
dystrophies, recessive 
DAG — Congenital muscular Congenital muscle dystrophies Congenital muscular dystrophy with 


dystrophies due to defective glycosylation 


mypoalycosylation of dystroglycan type AQ - (AR) 


JAG1 ~ Hereditary motor and CMT Charcot Marie-Tooth disease, 
Sere CEINMETR GIES SIM 
LICAM “Hereditary paraplegias MASA syndrome - (KR) 


(Spastic paraplegias, X‘inked) 


Hereditary sensory and auton 


Sensory neuropathies neuropathy type V - (AR) 


human (C) and mouse (D) muscle. Cell types (x axis) are ordered, left to right, 
such that the cell types that are shared between human and mouse within a 
tissue are presented first and species-specific cell types follow. (E and 

F) Putative cell-cell interactions in muscle implicating muscle disease genes. 


(columns 
consist of 
indicate hi 


are shown. A red outline indicates an FDR less than 0.1. Topic names 
the topic identifier and five words with the highest loadings. Red stars 
ghlighted topics. (B) Relation of broad cell types to monogenic 


muscle disease groups. Effect size and significance of enrichment of genes from 
monogenic muscle disease groups (rows) for broad cell type markers in each 


tissue (co 
shading in 


umns) are shown. A red outline indicates an FDR less than 0.1. Color 
dicates disease groups associated with only nonmyocytes (green), 


only myonuclei (yellow), or both (light purple). (© and D) DMD expression in 


Eraslan et al., Science 376, eabl4290 (2022) 


13 May 2022 


Shown are cell types (i 


by putative interaction 


nner color) from muscle tissues (outer color) connected 
s (dotted edges) between a receptor (left square) 


expressed in one cell type and a ligand (right square) expressed in the other in 
interactions involving myocytes (E) or only nonmyocytes (F). Black and 

gray connecting lines between cell types and genes indicate high and low 
expression, respectively. Bold formatting indicates a muscle disease gene. 
(G) Diseases highlighted in (E) and (F). ALS, amyotrophic lateral sclerosis; AD, 
autosomal dominant; AR, autosomal recessive; CMT, Charcot-Marie-Tooth 
disease; XR, X-linked recessive. 


9 of 17 


RESEARCH | RESEARCH ARTICLE 


hypertrophic neuropathy (a subtype of Charcot- 
Marie-Tooth disease) was specific to skeletal 
muscle (FDR = 2.28 x 107°), consistent with 
the need to maintain innervation to prevent 
muscle atrophy (125). Thus, distinctive cell 
expression patterns can provide insight into 
onset (Dejerine-Sottas is an early-onset dis- 
ease), severity, and the predominant peripheral 
nerves that are affected (i.e., motor, senso- 
ry, or autonomic), where heart can possibly 
act as a proxy for autonomic nerves. In other 
examples, adipocytes from esophagus mus- 
cularis were associated with metabolic myop- 
athies related to lipid metabolism (FDR = 
0.0003); the congenital muscular dystrophies 
genes COLG6AI, COL6A2, and COL6A3 are ex- 
pressed in fibroblasts, consistent with joint lax- 
ity and progressive contractures along with 
muscle weakness (26); and the recessive limb 
girdle muscular dystrophies gene DYSF is ex- 
pressed in immune cells, consistent with the role 
of more aggressive monocytes in disease pro- 
gression (127). Some of the enrichments in non- 
myocytes are also present in the same cell types 
in other nonmuscle tissues (e.g., breast adipo- 
cytes for metabolic myopathies or breast and 
skin pericytes for cardiomyopathies; fig. S251), 
highlighting that tissue-specific pathology 
may arise from the relation between an acces- 
sory cell’s broader function and specialized 
tissue demand. 

There were differences in the expression of 
monogenic muscle disease genes between slow 
and fast myocyte subsets for both classical and 
cytoplasmic myonuclei (fig. $9). As expected, 
slow myocytes preferentially expressed type 
1 fiber markers (36, 128) and disease genes 
MYH7 (FDR < 10” in cytoplasmic myonuclei) 
and PPARGCIA (FDR < 10°” in regular myo- 
nuclei). Fast myocytes specifically expressed 
the type 2 fiber marker MYH2 (FDR < 10°” in 
cytoplasmic myonuclei), in line with respec- 
tive clinical myosinopathy phenotypes (129). 
Congenital myopathy disease genes, which 
include genes known to be active in slow and 
fast myocytes, were enriched in both groups. 
The myotonic dystrophy (MD) type 2 gene 
CNBP, was enriched in regular myonuclei from 
fast myocytes [FDR < 10°”; (28); fig. S9, E 
and F], which are preferentially affected in 
MD type 2 (730, 131). By contrast, MD type 1 
disease gene DMPK was preferentially ex- 
pressed in cytoplasmic myonuclei from slow 
myocytes (P < 0.036, Welch’s ¢ test), which are 
perturbed in MD type 1 (32). Regular myo- 
nuclei subsets from slow myocytes were also 
enriched in TPM3 [FDR < 10°”; (28); fig. S9, E 
and F], which has been linked to congenital 
fiber type disproportion, a condition charac- 
terized by smaller slow myocyte fibers (133). 
Metabolic myopathy disease genes (GBEI/, 
ETFDH, and SLC25A20) were also enriched in 
slow myocyte markers (P < 0.05, Fisher’s exact 
test; fig. S9, E and F). 
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Many of the cell-type associations for mono- 
genic muscle disease genes were conserved 
between our human muscle atlas and corre- 
sponding mouse snRNA-seq data (26, 28) (fig. 
$25), but there were also notable differences. 
In both mouse and human, there were signi- 
ficant associations between skeletal muscle 
myonuclei and various dystrophies and myo- 
pathies (FDR < 0.1, Fisher’s exact test), between 
cardiac myonuclei and cardiomyopathies (FDR < 
0.1), between adipocytes (in skeletal muscle, 
esophagus, and heart) and metabolic myopa- 
thies (FDR < 0.1), and between Schwann cells 
in skeletal muscle and hereditary motor and 
sensory neuropathies (FDR < 0.1) (fig. S25H). 
However, dystrophin (DMD) expression across 
accessory cell types varied between human 
and mouse. In humans, high levels of DMD 
expression (comparable to that in myonuclei) 
were observed in adipocytes in all muscle 
types; pericytes and Schwann cells in skeletal 
muscle and esophagus muscularis; and enteric 
neurons in esophagus muscularis [mean ex- 
pression log(TP10K+1) > 2.0; Fig. 5C]. Lower, 
intermediate levels of DMD expression were 
observed in skeletal muscle fibroblasts and 
satellite cells, as well as esophagus muscula- 
ris ICCs [mean expression log(TP10K+1) < 2.0; 
Fig. 5C]. DMD expression in adipocytes sup- 
ports the possibility of local metabolic pertur- 
bation (734) (Fig. 5C), whereas its expression 
in the enteric nervous system and ICCs raises 
the possibility that perturbation of these cells 
contributes to the gastrointestinal dysfunction 
phenotype in Duchenne muscular dystrophy 
(135). In mouse, whereas Dmd expression is 
high in myonuclei, pericytes, satellite cells, 
and Schwann cells, it is low in adipocytes and 
fibroblasts across all three muscle tissue types 
[mean expression log(TP10K+1) < 1.0; Fig. 5D]. 
These differences suggest that the effects of DMD 
mutation on accessory cells and their contri- 
bution to disease (135, 136) may differ between 
human patients and mouse disease models (137). 


Cell-composition and cell-intrinsic secondary 
effects in muscle disease tissue 


Muscle dystrophies commonly display a sec- 
ondary shift in cellular composition, with fi- 
brotic or adipogenic replacement of muscle 
tissue (138), which can introduce secondary 
pathologic processes that exacerbate muscle 
loss (139). To characterize the cellular expres- 
sion patterns of disease genes whose bulk- 
tissue expression levels are altered in muscle 
diseases, we analyzed the cell-type specificity 
of genes up-regulated in bulk RNA-seq data 
from muscle tissues from 43 patients with rare 
muscle disorders (722) compared with healthy 
muscle tissues (740) (fig. S26). Some of the cell 
type-specific expression patterns reflected 
known biology, including up-regulation of 
hallmark fibroblast, adipocyte, and immune 
genes, likely because of changes in cell pro- 
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portions (e.g., COLGA2, ADIPOQ, and HLA-A, 
respectively). Other patterns suggest additional, 
cell-intrinsic regulatory events (beyond cell 
composition changes) that may modify dis- 
ease progression. For example, dermatopon- 
tin (DPT), an adipokine that promotes ECM 
remodeling and inflammation (141), is ex- 
pressed in adipocytes but is not correlated with 
ADIPOQ expression across adipocytes from dif- 
ferent tissues. This suggests that its expression 
is regulated and is not merely reflecting changes 
in cell composition. Similarly, ELK3, which is 
expressed in endothelial cells, suppresses angi- 
ogenesis (142) and may contribute to functional 
muscle ischemia in muscle disease (143), which 
is otherwise typically attributed to nitric oxide 
signaling (136). Thus, the increased resolution 
of a single-cell atlas can help disentangle sec- 
ondary effects related to cell-composition and cell- 
regulatory events in accessory cells during disease. 


Disease genes may affect 
receptor-ligand interactions 


Some disease genes encode receptors or lig- 
ands that participate in cell-cell interactions, 
such that loss-of-function mutations can affect 
tissue function through non-cell autonomous 
effects through these cell-cell interactions. We 
related cell types in muscle tissue to one 
another through receptor-ligand interactions 
(6, 26, 144) that included at least one mono- 
genic disease gene (28) in every tissue in our 
atlas (table S13). 

Our analysis suggests that mutations in some 
disease-causing genes may disrupt interactions 
between myocytes and other cell types. For 
example, mutations in ERBB3 (the disease gene 
for lethal congenital contracture syndrome) 
may disrupt interactions between myocytes 
and Schwann cells (Fig. 5E and table $13) and 
contribute to joint contractures as an associ- 
ated, but not primary, disease phenotype (145). 
Mutations in DAGI (the disease gene for con- 
genital muscular dystrophy), although known 
to interact primarily with laminin produced 
by fibroblasts (746, 147), may additionally dis- 
rupt interactions with immune cells through 
LGALS9 and alter their function (748). Puta- 
tive cell-cell interactions involving only non- 
myocytes included the disease genes LICAM 
(MASA syndrome), MET (arthrogryposis and 
muscular dysplasia), and NGF (hereditary sen- 
sory and autonomic neuropathy), each poten- 
tially affecting multiple cell pairs, including 
neurons and Schwann, satellite, immune, and 
stromal cells (Fig. 5, F and G, and table S13). 


Cell type-specific enrichment of QTL genes 
mapped to GWAS loci 


Single-cell atlases can also provide insights into 
cell-type specificity and mechanisms of action 
of the genes in disease-associated loci identi- 
fied by GWASs. Studies associating genetic var- 
iants to changes in gene expression or splicing 
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quantitative trait loci (eQTL or sQTL, respec- 
tively) showed tissue-specific colocalization with 
multiple loci from GWASs of human traits, 
including disease risk (7, 149-151), but lacked 
cellular resolution (/1). To prioritize cell types 
of action and causal genes for complex diseases 
and traits in specific cells and tissues, we used 
ECLIPSER (28, 152) to test whether GWAS loci 
from 21 complex traits (table S14), with likely ef- 
fects in at least one of the eight tissues analyzed, 
are enriched for genes with high cell type- 
specific expression in each tissue. We defined 
putative causal genes for each GWAS locus 
as the set of genes whose eQTLs and sQTLs 
(7) were in linkage disequilibrium [squared 
genotype correlation (77) > 0.8] with the lead 
GWAS variant(s) [Fig. 6A; (28)]. We further in- 
cluded genes prioritized by additional genom- 
ic data [e.g., Hi-C and protein QTLs (pQTLs)] 
and linkage to predicted deleterious protein- 
coding variants (153, 154). Because more than 
one gene typically maps to a GWAS locus using 
this approach (mean = 2, and maximum = 37 
for selected traits and 170 for null traits), we 
scored loci by the fraction of cell type-specific 
genes in the locus [Fig. 6A; (28)]. We assessed 
enrichment for each GWAS locus set against 
a null distribution of GWAS loci associated 
with tissue-unrelated traits using a Fisher’s 
exact test [Fig. 6A; (28)]. 

Seventeen of the traits were enriched in 
both expected and previously undescribed cell 
types at a tissue-wide FDR less than 0.05 
(Benjamini-Hochberg), 16 of which were sig- 
nificant (FDR < 0.05) across tissues (Fig. 6B, 
figs. S27 and S28, and tables S15 and S16). 
Among the expected associations are skin pig- 
mentation traits in melanocytes, autoimmune 
and inflammatory diseases in T and NK cells, 
COPD in lung fibroblasts, prostate cancer in 
luminal epithelial cells, atrial fibrillation and 
heart rate in myonuclei, and heart rate in 
lymphatic endothelial cells (155) (Fig. 6B). 
Type 2 diabetes loci were enriched in skeletal 
muscle adipocytes and in lymphatic endothe- 
lial cells in multiple tissues, which might con- 
tribute to the predisposition of type 2 diabetes 
to vascular disease (156, 157) (Fig. 6B). Less 
well-characterized cell type-trait associations 
included DCs (in almost all tissues) with non- 
melanoma skin cancer (158) and adipocytes 
(breast) with atrial fibrillation (table S15). 
GWAS loci enriched in a specific cell type from 
a known tissue of action frequently showed 
similar enrichment in the same cell type from 
other uninvolved tissues. For example, atrial 
fibrillation GWAS loci were enriched in myo- 
nuclei in heart, skeletal muscle, esophagus mus- 
cularis, and prostate (Fig. 6, B and C); coronary 
artery disease and heart rate loci were enriched 
in pericytes in five or six tissues in addition to 
heart; and prostate cancer loci were enriched 
in luminal epithelial cells in both prostate and 
breast (Fig. 6B and figs. S29 and S30). 
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Cell-type enrichment helped identify puta- 
tive causal genes in GWAS loci with multiple 
QTL-mapped genes (tables S15 and S16). On 
average, about two-thirds of genes driving the 
cell type-specific enrichment for a given trait 
in a relevant tissue [mean = 66%, 60 to 71.4% 
(95% confidence interval)] were also driving 
the enrichment in the same cell type in other 
tissues. For example, in the case of atrial fib- 
rillation and cardiac myonuclei, 26 out of 31 
myonuclei-specific genes that drove the enrich- 
ment signal in myonuclei in heart and were 
shared with at least one other tissue (Fig. 6D) 
were enriched in muscle system-related pro- 
cesses, such as muscle contraction (FDR < 0.05; 
table S17). The five myonuclei-specific genes 
that were specific to heart only (pink vertical 
lines in Fig. 6D) were instead enriched in heart 
development processes, such as cardiac muscle 
tissue development (FDR < 0.05; table S18). 
Cardiac myonuclei were also found to be the 
most relevant cell type for atrial fibrillation in 
two separate snRNA-seq studies of the human 
heart (33, 34) and based on ECLIPSER analysis 
of these two studies [fig. S31 and table S19; (28)]. 
CASQ2, which encodes a cardiac muscle mem- 
ber of the calsequestrin family, and the myosin 
heavy chain 6 and 7 genes (VYH6 and MYH7) 
were the top myonuclei-specific genes driving 
the enrichment signal for atrial fibrillation in all 
three studies (table S19). 


Associating GWAS genes with gene programs 
across cell types reveals six main trait groups 


To chart cellular programs and processes that 
may be affected by genetic variants, we asso- 
ciated a larger set of >2000 complex pheno- 
types with cell types and the covarying gene 
modules that these cell types express (28). We 
defined gene modules in the cells in our atlas 
by hierarchically clustering genes based on 
correlation across all cells, as well as within cell 
types. We then scored modules for their overlap 
with GWAS genes [defined by variant to gene 
mapping in Open Targets Genetics (153, 154)] 
that are also highly expressed by cell type 
[(28); fig. S32 and table S20]. Next, we grouped 
GWAS phenotypes into major groups by the 
similarity of their module enrichment across 
cell types (Fig. 7, A and B, and fig. $33). Finally, 
for each major group of traits, we identified the 
relevant cell types associated with the under- 
lying modules (Fig. 7, A and C) and tested the 
GWAS genes that overlapped with gene modules 
for functional enrichments (Fig. 7, A, D, and E). 
Traits and diseases partitioned into six 
major groups, spanning immune hypersensi- 
tivity, cardiovascular, calcium channel-related, 
cognitive and psychiatric, pigmentation, and 
HDL cholesterol-related based on their asso- 
ciations with cell types (Fig. 7B). The immune 
hypersensitivity disorders were associated 
with T cells, including the expected relation be- 
tween lung T cells and hay fever, allergic rhinitis, 
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and respiratory disease [(159, 160); Fig. 7C]. 
GWAS genes in the modules associated with 
these traits were enriched for lymphocyte ac- 
tivation and differentiation and T cell recep- 
tor signaling and cell-cell adhesion (Fig. 7D), 
with interleukin-35 (IL-35) signaling genes en- 
riched in hypothyroidism and inflammatory 
bowel disease (IBD), consistent with IL-35 up- 
regulation in Hashimoto’s thyroiditis (167) and 
IBD (162) and down-regulation in Graves’ dis- 
ease (163). The cardiovascular traits group was 
associated with pericytes and smooth muscle 
cells and enriched with blood circulation, 
smooth muscle contraction, muscle structure 
development, and cardiocyte differentiation 
genes. The cardiovascular group overlapped 
with the calcium channel-related group—which 
included blood pressure medication, pulse 
rate, medication use of calcium-channel block- 
ers, and vascular system traits, as well as 
schizophrenia and autism spectrum disorder, 
which are psychiatric disorders with known 
calcium channel associations (J64)—and was 
enriched with membrane depolarization and 
calcium ion channel genes (Fig. 7D). Other 
cognitive and psychiatric phenotypes grouped 
separately and were enriched with neuronal 
synapse organization, structure, and activity genes 
(Fig. 7D) Finally, a group of HDL cholesterol 
traits was associated with adipocytes in all 
three muscle tissues and enriched for fatty 
acid, triglyceride, and lipid homeostasis and 
related metabolic processes (monocarboxylic 
and glycerol metabolism), including ANGPTL8 
and PNPLA3 (Fig. 7E and table S20), which 
are genes involved in lipolysis regulation in 
adipocytes that might affect extrahepatic choles- 
terol transport via HDLs (165, 166). 


Toward large-scale snRNA-seq of human 
tissues with pooling 


To enable future studies at the population 
scale, we tested whether frozen samples from 
different individuals can be pooled for snRNA- 
seq, followed by computational demultiplex- 
ing, as a cost-effective and scalable approach 
(167), as previously applied to large-scale studies 
of human peripheral blood mononuclear cells 
(24). We processed lung or prostate samples 
jointly from three individuals using the CST 
and TST protocols. We pooled tissue samples 
from three individuals and processed the pool 
for single-nucleus extraction, thus minimizing 
technical batch effects and wet-lab time. After 
sequencing, we removed ambient RNA (30) 
and performed de novo genotype-based de- 
multiplexing to assign nuclei to donors [using 
souporcell (168); fig. S34; (28)]. We validated 
the demultiplexing by comparing the genotype- 
based assignments to those from an expression- 
based multinomial logistic classifier that assigned 
donor identity to each nucleus profile after 
training with unpooled samples of the same 
donors, which showed high concordance 
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Fig. 6. Cell type-specific enrichment of eQTL and sQTL target genes mapped 
to GWAS loci. (A) Schematic of the method (ECLIPSER). (B) Cell-type enrichment 
of genes mapped to GWAS loci for 17 of the 21 complex traits tested with at 
least one tissue-wide significant result (FDR < 0.05, correcting for all cell types tested 
per tissue per trait) across eight GTEx tissues. Gray, orange, and red borders indicate 
nominal, tissue-wide, and experiment-wide significance (FDR < 0.05, correcting for 

all cell types tested across eight tissues and 21 traits), respectively. Only cell types with 
at least one tissue-wide enrichment are shown. (C and D) Myonuclei and pericyte genes 
enriched in atrial fibrillation GWAS loci (tissue-wide FDR < 0.05, Bayesian Fisher's 
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NDUFB10- 


IRF2BPL 4 
CAND2 + 


exact test). Fold-enrichment (x axis) of cell types (y axis) for atrial fibrillation GWAS in 
heart (top) and skeletal muscle (bottom) is shown in (C). Error bars represent 95% 
credible intervals. Red indicates tissue-wide significance, orange indicates nominal 
significance, and blue indicates nonsignificance (P 2 0.05, Bayesian Fisher's exact test). 


Differential expression in 


myonuclei versus other cell types from heart (red), skeletal 


muscle (blue), esophagus muscularis (orange), and prostate (brown) of the genes 

(x axis) driving enrichment of atrial fibrillation GWAS loci in heart cardiac myonuclei is 
shown in (D). Gray and pink vertical lines indicate loga(fold change) > 0.5 and FDR < 0.1 
in myonuclei in all four tissues or only in heart, respectively. FC, fold change. 
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Fig. 7. Cell types and gene modules relevant for trait and disease groups by 
GWAS module enrichment. (A) Schematic of the module-based enrichment method. 
Shaded edges indicate associations between cell types and phenotypes through 
modules (middle). (B to E) Trait and disease groups identified by GWAS-cell type 


relationships. Similarity (Spearman correlation coefficient) 


diseases (rows and columns) by enriched cell types is shown in (B). Dashed lines 
demarcate trait and disease groups. Shown in (C) is the cell-type enrichment for each 


of the GWAS-enriched modules. Significance [-logio(FDR 


between the two approaches (accuracy 88 to 
96%; fig. S34). Moreover, genotype-based 
doublet calls were concordant with expression- 
based doublet calls in both lung and prostate 
(mean balanced accuracy of 63%) (fig. S34). 


Discussion 


Cross-tissue atlases allow us to characterize 
tissue-specific and tissue-agnostic features of 
cells of a common type that serve accessory 
roles in tissues, such as immune and stroma 
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HALAII-expressing macrophages, our results 
reinforce the notion of functional specification 
of these two macrophage states into tissue 
support and tissue immunity, respectively (47), 
and propose a model for their differentiation 
(fig. S35, B and C). In mice, Lyvel'® macro- 
phages are localized perivascularly, whereas 
MHcrr macrophages are found in proxim- 
ity to neurons (47). Future studies can address 
this localization in humans, and signals that 
govern the tissue-specific ratios of LYVEI- versus 


cells (fig. S35A). For example, for LYVEI- and 
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(circle color) of enrichment of gene sets (columns) with the genes in the intersection 
of a gene module and GWAS genes for each trait or disease in (B) (rows) are 
shown in (D). A red outline indicates an FDR less than 0.1. Shown in (E) is the number 
of traits (x axis) in each module in (B) where a gene (y axis) is detected as the 
driver of the association in the intersection of the gene modules, a trait or disease 
enriched in the module, and a functional gene set for the top 10 most frequently 
identified genes in the enrichment analysis of each module. EA, East Asian; SCZ, 
Schizophrenia; UKBB, UK Biobank. 


Our data demonstrate the prevalence of 
LAM-like cells across tissue contexts and 
pathologies, including breast and heart, where 
we recovered both adipocytes and LAM-like 
cells (Figs. 1 and 3C and fig. S2). In line with a 
model of lipid-induced differentiation of macro- 
phages toward the LAM state (85), our classi- 
fier recovered LAMs in pathologies characterized 
by lipid accumulation: atherosclerosis, Crohn’s 
disease, and acne. The inferred role for PPARG 
and NRIH3 in driving the LAM expression pro- 
gram suggests a model in which signaling 


13 of 17 


RESEARCH | RESEARCH ARTICLE 


through lipid-bound receptors on macrophages, 
such as TREM2 or CD36, up-regulates the ex- 
pression of more lipid receptors, as well as of 
lipid-modifying enzymes through PPARG and 
NR1H3. Because we observed LAMs in healthy 
organs and in conditions linked to lipid accu- 
mulation, future studies may identify other 
tissue-specific signals or conditions that can 
trigger LAM-like states. 

In lung alveolar fibroblasts, we identified an 
interconnected functional module that may 
enable an adequate response to alveolar dis- 
tortion, through transduction of mechanical 
cues from the ECM to cytosolic calcium- 
induced cytoskeletal contraction (fig. S35, D 
and E). These include BM adhesion genes, 
including /7GA8 and its interacting partner 
NPNT (95-100); a calcium transport module 
featuring neurotransmitter receptors and 
multiple calcium ion channels, including the 
mechanosensitive channel PIEZO2; and an 
actomyosin contractility program that fea- 
tures MYLK and mediates cytosolic calcium- 
induced cytoskeletal contraction, which is 
required for the transduction of mechanical 
cues from the ECM (169). PIEZO2 and ITGA8 
are also coexpressed in intraglomerular mesan- 
gial cells (170, 171), where PIEZO2 may sense 
mechanical forces resulting from changes in 
blood flow and JTGA8 confers contractility 
and adhesion (172). Because the alveolus and 
glomerulus are stretch sensors of air pressure 
and blood pressure, respectively, PIEZO2*/ 
ITGAS8* alveolar fibroblasts may contribute to 
mechanosensing of alveolar tension, which so 
far has been mainly attributed to Piezo2* sen- 
sory neurons in mice (777), suggesting conserved 
features in mechanosensing across organs. Both 
lung alveolar fibroblasts and myofibroblasts 
have been localized to alveoli (40), suggesting 
a possible relationship, possibly through FGFR4 
in alveolar fibroblasts and its ligand FGFI8 in 
myofibroblasts (fig. SI8D) (109, 173). 

We further demonstrated the utility of our 
tissue atlas for monogenic and polygenic dis- 
ease biology. Many monogenic disease gene 
modules that are defined by comorbidity [e.g., 
diabetes and lipodystrophy (74)] or similarity 
of clinical phenotypes (e.g., muscle diseases) 
were enriched in expected cell types (fig. S35F). 
Focusing on the pathobiology of monogenic 
muscle diseases (fig. S35G), we highlighted 
nonmyocyte cell populations with a potential 
role in muscle diseases, including nervous sys- 
tem, immune, and stromal cells (778), as well 
as specific myonuclei subsets that express 
muscle disease genes. Whether the multiple 
myonuclei subsets we observed are related to 
multinucleation and specialization of different 
nuclei in one syncytium (37) remains unclear. 
Some disease-risk genes may also disrupt cell- 
cell interactions in the muscle. Notably, we 
observed varying levels of DMD expression 
across cell types, as well as between human 
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and mouse. Variation in DMD isoform expres- 
sion, which has critical implications in Duchenne 
muscular dystrophy (135), can be investigated 
in future studies. For common complex dis- 
eases, we found significant enrichment in spe- 
cific cell groups for multiple traits (fig. S35H). 
For more than half of the traits, there was 
enrichment for the same cell type in different 
tissues driven by both common and tissue- 
specific genes. Future work will be needed to 
extend these analyses across a broad set of 
tissues and cell types and examine the role of 
disease-associated genes and cell types in a dis- 
ease context in patient samples. 

Advances in single-cell epigenomics (775) and 
multi-omics (176-178) should further enable 
linking GWAS variants to their target genes 
and the cell types and programs in which they 
act. Recent findings indicating that a large 
fraction of genetic regulatory effects linked 
to GWAS variants can only be detected at the 
cellular level (77, 25) suggest that cell-level 
eQTL maps will be essential. The experimen- 
tal and computational methods we developed 
for a cross-tissue atlas, and the biological 
queries we defined, will provide a basis for 
scaling such efforts to hundreds of individuals 
and diverse populations. 


Methods summary 


Tissue samples were selected from among a 
subset of GTEx project samples that were flash 
frozen and banked. Nuclei from each sample 
were extracted using the EZ, CST, NST, and 
TST protocols described in (26). Libraries for 
snRNA-seq were generated using the Chro- 
mium Single Cell 3’ v2 Reagent Kit (10x Geno- 
mics), and sequencing was performed with 
Illumina HiSeq X (96 samples) or NextSeq 
(three samples), according to the manufacturer’s 
protocols. The resulting snRNA-seq data were 
aligned and quantified using CellRanger v2.1.0 
(10x Genomics), ambient RNA correction was 
performed using CellBender (30), and low- 
quality nuclei were filtered out using standard 
criteria (28). The resulting snRNA-seq expres- 
sion profiles were integrated across samples 
using a total correlation variational autoen- 
coder (28). Detailed descriptions of all compu- 
tational analyses are provided in (28). 
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INTRODUCTION: Immune cells that seed periph- 
eral tissues play a vital role in health and dis- 
ease, yet most studies of human immunity 
focus on blood-derived cells. Immune cells 
adapt to local microenvironments, acquiring 
distinct features and functional specializa- 
tion. Dissecting these molecular adaptations 
through the systematic assessment of cells 
across the human body promises to transform 
our understanding of the immune system at 
the organismal level. 


RATIONALE: To comprehensively assess immune 
cell types, we collected donor-matched tissues 
from 12 deceased organ donors. We isolated im- 
mune cells and performed single-cell RNA se- 
quencing and paired VDJ sequencing for T cell 
and B cell receptors, resulting in high-quality 
data for ~330,000 immune cells. To resolve 
cell identities, we developed CellTypist, a logis- 
tic regression-based framework using stochas- 
tic gradient descent learning. This cross-tissue 
annotation enabled interrogation of shared and 


Human immune cells across tissues 
12 donors, 4 to 12 


tissue-specific expression modules and cell states 
within myeloid and lymphoid cell lineages. 


RESULTS: We developed CellTypist by curating 
and harmonizing public datasets to assemble 
a comprehensive immune cell type reference 
database (https://www.celltypist.org). CellTypist 
was then applied to our data, generated across 
multiple tissues and individuals (https://www. 
tissueimmunecellatlas.org/). Altogether, we 
detected 101 immune populations and per- 
formed extensive cross-tissue comparisons 
for each subset. Although macrophages dis- 
played prominent tissue-restricted features, 
some convergent features were also detected. 
For example, macrophages expressing 
erythrophagocytosis-related genes were widely 
found in spleen, liver, bone marrow, and 
lymph nodes. Heterogeneity within defined 
subpopulations was also observed, such as 
migratory dendritic cell adaptations. Within 
adaptive immune lineages, we identified 
tissue-specific distributions of memory popula- 


CellTypist: automated cell type annotation 
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Cross-tissue atlas of human immune cells. Donor-matched tissues from 

12 deceased organ donors were profiled using single-cell RNA sequencing [including 
single-cell VDJ sequencing (scVDJ-seq)]. Cell type annotation was achieved with 
CellTypist, an automated cell type annotation pipeline, followed by manual curation. 
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tions. Plasma cells showed a restricted tissue 
distribution, whereas memory B cells were more 
widely distributed. Similarly, tissue-resident 
memory T (Tay) cells were more restricted in 
distribution compared with central and effector 
memory T cells. Notably, Tay, cells harbored 
significant diversity, including of and y6 lin- 
eages, ascertained by VDJ sequencing. Assess- 
ment of clonal dynamics revealed the highest 
clonal expansions in Try cells and the most 
frequent clonal sharing between resident and 
effector memory populations. 


CONCLUSION: Here we present an immune cell 
atlas of myeloid and lymphoid lineages across 
adult human tissues. We developed CellTypist 
for automated immune cell annotation and 
performed an in-depth dissection of cell popu- 
lations, identifying 101 cell types or states from 
more than one million cells, including previ- 
ously underappreciated cell states. We also un- 
covered convergent phenotypes across tissues 
within given lineages and described tissue ad- 
aptation signatures for a number of cell types, 
including macrophages and resident memory 
T cells. Together, we have extended our under- 
standing of how human immunity functions 
as an integrated, cross-tissue network, and we 
provide the scientific community with several 
key new resources. 
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In-depth analysis of transcriptional features revealed convergent and divergent gene 
expression programs for each cell lineage across lymphoid and nonlymphoid 
tissues. scVDJ-seq enabled repertoire analysis across cell subsets and tissues. 
scRNA-seq, single-cell RNA sequencing; TCR, T cell receptor; BCR, B cell receptor. 
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Despite their crucial role in health and disease, our knowledge of immune cells within human tissues 
remains limited. We surveyed the immune compartment of 16 tissues from 12 adult donors by single-cell 
RNA sequencing and VDJ sequencing generating a dataset of ~360,000 cells. To systematically resolve 
immune cell heterogeneity across tissues, we developed CellTypist, a machine learning tool for rapid and 
precise cell type annotation. Using this approach, combined with detailed curation, we determined the 
tissue distribution of finely phenotyped immune cell types, revealing hitherto unappreciated tissue-specific 
features and clonal architecture of T and B cells. Our multitissue approach lays the foundation for 
identifying highly resolved immune cell types by leveraging a common reference dataset, tissue-integrated 
expression analysis, and antigen receptor sequencing. 


he immune system is a dynamic and 
integrated network made up of many 
different cell types distributed across 
the body that act together to maintain 
tissue homeostasis and mediate pro- 
tective immunity. In recent years, a grow- 
ing appreciation of immune ontogeny and 
diversity across tissues has emerged. For 
example, we have gained insights into how 
macrophages derived in embryogenesis con- 
tribute to the distinctive adaptation of tissue- 
resident myeloid cells, such as Langerhans 
cells in the skin, microglia in the brain, and 
Kupffer cells in the liver (/-3). Other pop- 
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ulations, such as innate lymphoid cells (ILCs), 
including natural killer (NK) cells and non- 
conventional T cells [natural killer T (NKT) 
cells, mucosal-associated invariant T (MAIT) 
cells, and gamma-delta (y65) T cells], have 
circulating counterparts but are highly en- 
riched at barrier sites where they mediate 
tissue defense and repair (4). In addition, 
following resolution of an immune response, 
antigen-specific, long-lived tissue-resident 
memory T (Ty) cells persist in diverse sites, 
where they provide optimal protection against 
secondary infections [reviewed in (5-7)]. 
Studies in mice have revealed the central 
role of tissue immune responses in protective 
immunity, antitumor immunity, and tissue re- 
pair; however, human studies have largely 
focused on peripheral blood as the most ac- 
cessible site. Given that circulating immune 
cells make up only a subset of the entire 
immune cell landscape, understanding human 
immunity requires a comprehensive assess- 
ment of the properties and features of immune 
cells within and across tissues. Recent progress 
in the analysis of tissue immune cells has im- 
plemented organ-focused approaches (8-12), 
whereas use of tissues obtained from organ 
donors allows for analysis of immune cells 
across multiple sites across an individual 
(13-19). We previously reported single-cell 
RNA sequencing (scRNA-seq) analysis of 
T cells in three tissues from two donors (20), 
identifying tissue-specific signatures. How- 
ever, despite the effort to assemble murine 
(21) and human (22, 23) multitissue atlases, 
large-scale cross-tissue scRNA-seq studies that 
investigate tissue-specific features of innate 
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and adaptive immune compartments have not 
been reported. 

Furthermore, a fundamental challenge of 
increasingly large single-cell transcriptomics 
datasets is cell type annotation, including iden- 
tifying rare cell subsets and distinguishing 
novel discoveries from previously described 
cell populations. Currently available automated 
annotation workflows leverage organ-focused 
studies and lack a comprehensive catalog of all 
cell types found across tissues. Therefore, a 
single unified approach is required to provide 
an in-depth dissection of immune cell type and 
immune state heterogeneity across tissues. 

To address these needs, we comprehensively 
profiled immune cell populations isolated from 
a wide range of donor-matched tissues from 
12 deceased individuals, generating nearly 
360,000 single-cell transcriptomes. To sys- 
tematically annotate multitissue immune cells, 
we developed CellTypist, a machine learning 
framework for cell type prediction initially 
trained on data from studies across 20 human 
tissues (see supplementary text in the supple- 
mentary materials) and then updated and 
extended by integrating immune cells from 
our dataset. 


Results 
CellTypist: A pan-tissue immune database and a 
tool for automated cell type annotation 


To systematically assess immune cell type het- 
erogeneity across human tissues, we performed 
scRNA-seq on 16 different tissues from 12 de- 
ceased organ donors (Fig. 1, A and B, and 
table S1). The tissues studied included pri- 
mary (bone marrow) and secondary (spleen 
and lung-draining and mesenteric lymph 
nodes) lymphoid organs, mucosal tissues (gut 
and lung), as well as blood and liver. When 
available, we also included additional donor- 
matched samples from tissues such as thy- 
mus, skeletal muscle, and omentum. Immune 
cells were isolated from tissues as detailed in 
the methods. After stringent quality control, 
we obtained a total of 357,211 high-quality 
cells, of which 329,762 belonged to the im- 
mune compartment (fig. S1, A and B). 
Robust cell type annotation remains a major 
challenge in single-cell transcriptomics. To 
computationally predict cellular heterogeneity 
in our dataset, we developed CellTypist (24), a 
cell type database, which in its current form is 
focused on immune cells, that provides a di- 
rectly interpretable pipeline for the automatic 
annotation of scRNA-seq data (Fig. 1C). 
One of the distinctive and valuable aspects 
of CellTypist is that its training set encom- 
passes a wide range of immune cell types 
across tissues. This breadth is of critical im- 
portance given that immune compartments 
are shared across tissues, warranting accurate 
and automated cell annotation in an organ- 
agnostic manner. In brief, to develop CellTypist, 
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Fig. 1. Automated annotation of immune cells across human tissues using 
CellTypist. (A) Schematic showing sample collections of human lymphoid and 
nonlymphoid tissues and their assigned tissue name acronyms. (B) Schematic of 
single-cell transcriptome profiling and paired sequencing of a TCR, yé TCR, 


and BCR variable regions. (©) Workflow of CellTypist 


including data collection, 


processing, model training, and cell type prediction (upper panel). Performance 


curves showing the Fl score at each iteration of train 


we integrated cells from 20 different tissues 
from 19 reference datasets (fig. S2) where 
we had deeply curated and harmonized cell 
types at two knowledge-driven hierarchical 
levels (figs. S3 to S8). This was followed by a 
machine learning framework to train a model 
using logistic regression with stochastic gra- 
dient descent learning (see methods). Per- 
formance of the derived models, as measured 
by the precision, recall, and global F1 score, 
reached ~0.9 for cell type classification at 
both the high- and low-hierarchy levels (Fig. 1C 
and fig. S9, A and B). Notably, representation 
of a given cell type in the training data was a 
major determinant of its prediction accuracy 
(fig. S9C), implying that higher model per- 
formance can be achieved by incorporating 
additional datasets. Moreover, CellTypist pre- 
diction was robust to differences between 
training and query datasets including gene 
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ing with mini-batch 


stochastic gradient descent for high- and low-hierarchy CellTypist models, 
respectively (lower panel). The black curve represents the median F1 score 
averaged across the individual Fl scores of all predicted cell types. (D) Uniform 
manifold approximation and projection (UMAP) visualization of the immune 

cell compartment colored by tissues. Note that jejunum samples in (A) were 
further split into epithelial (JEJEPI) and lamina propria (JEJLP) fractions. 

(E) As in (D), but colored by predicted cell types using CellTypist. 


expression sparseness (fig. S10) and batch 
effects (fig. S11). 

Next, we applied CellTypist’s high-hierarchy 
(i.e., low-resolution, 32 cell types) classifier to 
our cross-tissue dataset (Fig. 1D) and found 15 
major cell populations (fig. SIC). At this level 
of resolution, clear compositional patterns 
emerged in lymphoid versus nonlymphoid 
tissues, and within the lymphoid tissues be- 
tween spleen and lymph nodes, and appeared 
not to be driven by differences in dissociation 
protocols (fig. S12). As the training datasets 
of CellTypist contained hematopoietic tissues 
with definitive annotations for progenitor 
populations, the classifier was able to resolve 
several progenitors including hematopoietic 
stem cells (HSCs), multipotent progenitors 
(MPPs), promyelocytes, early megakaryo- 
cytes, and pre-B and pro-B cells. Furthermore, 


CellTypist clearly resolved monocytes and 
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macrophages, which often form a transcrip- 
tomic continuum in scRNA-seq datasets owing 
to their functional plasticity. Thus, CellTypist 
was successfully able to identify major groups 
of cell populations with different abundances 
in our dataset (fig. S1C). 

To automatically annotate fine-grained im- 
mune subpopulations, we next applied the 
low-hierarchy (high-resolution, 91 cell types 
and subtypes) classifier, which was able to 
classify cells into 43 specific subtypes, includ- 
ing subsets of T cells, B cells, ILCs, and mono- 
nuclear phagocytes (Fig. 1E). This revealed a 
high degree of heterogeneity within the T cell 
compartment, not only distinguishing between 
of and y6 T cells but also between CD4* and 
CD8* T cell subtypes and their more detailed 
effector and functional phenotypes. Specifi- 
cally, the CD4* T cell cluster was classified 
into helper, regulatory, and cytotoxic subsets, 
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Fig. 2. Myeloid compartment across tissues. (A) UMAP visualization of the 
cell populations in the myeloid compartment. (B) Dot plot for expression 

of marker genes of the identified myeloid populations. Color represents 
maximum-normalized mean expression of cells expressing marker genes, and 
size represents the percentage of cells expressing these genes. (C) UMAP 
visualization of the tissue distribution in the myeloid compartment. (D) Heatmap 
showing the distribution of each myeloid cell population across different 
tissues. Cell numbers are normalized within each tissue and later calculated as 


and the CD8* T cell clusters contained un- 
conventional T cell subpopulations such as 
MAIT cells. In the B cell compartment, a clear 
distinction was observed between naive and 
memory B cells. Moreover, CellTypist re- 
vealed three distinct subsets of dendritic cells 
(DCs)—DC1, DC2, and migratory DCs (migDCs) 
(25, 26)—again highlighting the granularity 
that CellTypist can achieve. This fine-grained 
dissection of each compartment also allowed 
for the cross-validation and consolidation of 
cell types by examining the expression of 
marker genes derived from CellTypist models 
in cells from our dataset (fig. SID). 

To summarize, we generated an in-depth 
map of immune cell populations across hu- 
man tissues and developed a framework for 
automated annotation of immune cell types 
and subtypes. CellTypist produced fine-grained 
annotations on our multitissue and multi- 
lineage dataset, and its performance, as as- 
sessed on multiple metrics, was comparable 
to or better than other label-transfer meth- 
ods, with minimal computational cost (figs. 
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least two donors were incl 
a given tissue relative to t 


marker genes. (F) smFISH 
validating the AIRE* migra 


$13 and S14). This approach allowed us to 
identify fine-grained cell subtypes, such as the 
progenitors and dendritic cell compartments 
at full transcriptomic breadth, resolving 43 cell 
states in total across our dataset. This auto- 
mated annotation forms the basis for further 
cross-tissue comparisons of cell compartments 
in the sections below. 


Tissue-restricted features of mononuclear 
phagocytes 

Mononuclear phagocytes, including monocytes, 
macrophages, and dendritic cells, are critical for 
immune surveillance and tissue homeostasis. 
Automatic annotation by CellTypist identi- 
fied eight populations in this compartment 
(fig. S15A). To explore macrophage hetero- 
geneity further, we built upon CellTypist’s 
annotation by performing additional man- 
ual curation, which revealed further hetero- 
geneity within the macrophages (Fig. 2A and 
fig. SI5B). The identities of these cells were 
supported by expression of well-established 
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proportions across tissues. Only tissues containing >50 myeloid cells in at 


uded. Asterisks mark significant enrichment in 
he remaining tissues [Poisson regression stratified by 


donors, P < 0.05 after Benjamini-Hochberg (BH) correction]. (E) Violin plot 
for genes differentially expressed in migratory dendritic cells across tissues. 
Color represents maximum-normalized mean expression of cells expressing 


visualization of ITGAX, CCR7, and AIRE transcripts, 
tory dendritic cells in lung-draining lymph nodes. 


dependently identified from CellTypist models 
(fig. S15C). Moreover, existence of these cell 
types was cross-validated, and thus consolidated, 
in the training datasets of CellTypist (fig. S15D), 
as well as in myeloid cells from two additional 
studies of the human intestinal tract (27) (fig. 
S15E) and lung (28) (fig. SI5F). 

Among macrophages, lung-resident cells 
constituted the majority and were classified 
into two major clusters: (i) alveolar macro- 
phages expressing GPNMB and TREM2, 
markers that have been related to alveolar 
macrophages (29) and disease-associated 
macrophages (30), respectively; and (ii) in- 
termediate macrophages with distinctive 
expression of TNIP3 (Fig. 2, B to D). TNIP3 
(TNFAIP3-interacting protein 3) binds to A20, 
encoded by TNFAIP3, and inhibits tumor 
necrosis factor, interleukin-1 (IL-1), and 
lipopolysaccharide-induced nuclear factor 
«B (NF-«B) activation (37). Its expression in 
lung macrophages may be related to underly- 
ing pathology, as it was primarily detected in 


marker genes (Fig. 2B) and by markers in- 
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one donor (A29), a multitrauma donor with 
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Fig. 3. B cell compartment across tissues. (A) UMAP visualization of the 
cell populations in the B cell compartment. (B) Dot plot for expression of 
marker genes of the identified B cell populations. Color represents maximum- 
normalized mean expression of cells expressing marker genes, and size 
represents the percentage of cells expressing these genes. (©) UMAP 
visualization of the tissue distribution in the B cell compartment. 

(D) Heatmap showing the distribution of each B cell population across 
different tissues. Cell numbers are normalized within each tissue and later 
calculated as proportions across tissues. Only tissues containing >50 B cells 


substantial lung contusions. Notably, this 
population also expressed the monocyte- 
recruiting chemokine CCL2 (Fig. 2B), pro- 
viding a means of replenishing the lung 
macrophage pool. 

Other macrophage subsets in our data also 
showed a high degree of tissue restriction 
(Fig. 2D). Erythrophagocytic macrophages, 
including red pulp macrophages and Kupffer 
cells, mainly populated the spleen and liver, 
as expected, and shared high expression of 
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CD5L, SCL40A1, and the transcription factor 
SPIC (32). Notably, a number of macrophages 
from lymph nodes clustered together with 
erythrophagocytic macrophages, pointing 
to the presence of a small number of iron- 
recycling macrophages in lymph nodes (Fig. 
2D). Another macrophage population spe- 
cifically present in the gut expressed CD209 
(encoding DC-SIGN) and JGFI, markers that 
have been previously reported in mature in- 
testinal macrophages and M2-like macrophages, 
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in at least two donors were included. Asterisks mark significant enrichment in 
a given tissue relative to the remaining tissues (Poisson regression 

stratified by donors, P < 0.05 after BH correction). (E) Stacked bar plots showing 
the isotype distribution per tissue within memory B cells and the plasma 

cells. (F) Violin plot of the hypermutation frequency on the IgH chain across 
isotypes. Significant difference among IgG4, IgG2, and IgGl and between 

IgA2 and IgAl is marked by asterisks (Wilcoxon rank sum test, P < 0.05). 

(G) Scatterpie plot showing the tissue distribution and B cell subsets of 
expanded clonotypes (>10 cells). Each vertical line represents one clonotype. 


respectively (33, 34). Lastly, monocytes were 
clearly grouped in two major subgroups, classi- 
cal and nonclassical monocytes, which differed 
in the expression of CD14, FCGR3A, and CX3CRI 
and in their tissue distribution (Fig. 2, A to D). 
Among dendritic cells, DC1 expressed XCRI 
and CLECQA, consistent with their identity as 
conventional DCs (DC1), specialized for cross- 
presentation of antigens (Fig. 2B). Conventional 
DC2s expressed CDIC and CLECIOA, and migDCs 
were CCR7*LAMP3". CCR7 is up-regulated in 
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tissue DCs following Toll-like receptor or 
Fe-gamma receptor ligation (35, 36), enabling 
migration toward CCL19/21-expressing lym- 
phatic endothelium and stromal cells in the 
T cell zone of lymph nodes (37, 38). Consistent 
with this, we observed a marked enrichment of 
CCR7* migDCs in lung-draining and mesen- 
teric lymph nodes (Fig. 2D). Notably, migDCs 
showed up-regulation of AJRE, PDLIM4, and 
EBI3 in lymph nodes (Fig. 2E). Extrathymic 
expression of the autoimmune regulator AIRE 
has been reported in humans (39, 40), however, 
its functional role in secondary lymphoid 
organs remains poorly understood and is a 
matter of intense research (41-43). We vali- 
dated the presence of migDCs in lung-draining 
lymph nodes by immunofluorescence (fig. SI6A) 
and AJRE expression by single-molecule fluores- 
cence in situ hybridization (smFISH) (Fig. 2F). 
In addition, another subpopulation of migDCs 
found in lung and lung-draining lymph nodes 
up-regulated CRLF2 [encoding thymic stromal 
lymphopoietin receptor (TSLPR)], chemokines 
(CCL22 and CCL17), CSF2RA, and GPR157 
(Fig. 2E). TSLPR is involved in the induction 
of T helper 2 (T};2) cell responses in asthma 
(44). Expression of these genes in lung DCs 
was also observed in published scRNA-seq 
datasets (45, 46) (fig. S16, B and C). These 
observations suggest that dendritic cell acti- 
vation coincides with the acquisition of tissue- 
specific markers that differ depending on the 
local microenvironment. 

Overall, our analysis of the myeloid com- 
partment has revealed shared and tissue- 
restricted features of mononuclear phagocytes, 
including alveolar macrophages in the lung, 
iron-recycling macrophages mostly localized 
in the spleen, and subtypes of migratory den- 
dritic cells. 


B cell subsets and immunoglobulin repertoires 
across tissues 


B cells comprise progenitors in the bone mar- 
row, developmental states in lymphoid tissues, 
and terminally differentiated memory and 
plasma cell states in lymphoid and nonlym- 
phoid tissues. They play a central role in 
humoral immunity via the production of 
antibodies tailored to specific body sites. We 
first annotated the B cells using CellTypist 
and obtained six populations (fig. S17A). 
Manual curation revealed further hetero- 
geneity in memory B cells and plasma cells, 
identifying 11 cell types in total (Fig. 3A and 
fig. S17B). As with the myeloid compartment, 
we cross-checked and verified these cell types 
in CellTypist training datasets (fig. S17, C and 
D) and in two independent immune datasets 
from gut (27) and lung (28) (fig. S17, E and F). 

Naive B cells expressed TCLIA and were 
primarily found in lymphoid tissues (Fig. 3, 
B to D). In addition, we identified two popu- 
lations of germinal center B cells, express- 
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ing AICDA and BCL6, that differed in their 
proliferative states (marked by MKI67). Of 
note, we did not find differential expression 
of dark zone and light zone marker genes in 
these two populations, probably reflecting 
limited germinal center activity in our adult 
donors. Moreover, these germinal center pop- 
ulations were present in lymph nodes and 
diverse gut regions (Fig. 3, C and D), pre- 
sumably representing Peyer’s patches. Within 
memory B cells, which were characterized 
by expression of the B cell lineage markers 
(MS4AI and CD19) and TNFRSFI13B, we found 
a transcriptomically distinct cluster positive 
for ITGAX, TBX21, and FCRL2, encoding CD11c, 
T-bet, and the Fc receptor-like protein 2, re- 
spectively (Fig. 3B). CD11c*T-bet* B cells, also 
known as age-associated B cells (ABCs), have 
been reported in autoimmunity and aging 
(47-49) and likely correspond to this JTGAX* 
memory B cell population. Unlike conven- 
tional memory B cells, ABCs showed relatively 
low expression of CR2 (encoding CD21) and 
CD27 (Fig. 3B). This subset was mainly pre- 
sent in the liver and bone marrow, while in 
secondary lymphoid organs, it was primarily 
found in the spleen, as confirmed by flow 
cytometry and immunofluorescence (Fig. 3, C 
and D, and fig. S18). This data deepens our 
understanding of the phenotype of this non- 
classical subtype of memory B cells and their 
tissue distribution. 

We uncovered two populations of plasma- 
blasts and plasma cells marked by expression 
of CD38, XBPI, and SDC1. Whereas the former 
expressed M/K/67 and were found in the spleen, 
liver, bone marrow, and blood, the latter ex- 
pressed the integrin alpha-8-encoding gene 
ITGA8 and the adhesion molecule CERCAM 
and were enriched in the jejunum and liver 
(Fig. 3, B to D). ITGA8* plasma cells have re- 
cently been reported in the context of an 
analysis of bone marrow plasma cells (50) 
and are likely a long-lived plasma cell popu- 
lation that is quiescent and tissue resident. 
Here we expand their tissue distribution to 
the liver and gut and describe their specific 
clonal distribution pattern. 

B cells have an additional source of variabil- 
ity due to VDJ recombination, somatic hyper- 
mutation, and class switching, which can relate 
to cell phenotype. Therefore, we performed 
targeted enrichment and sequencing of B cell 
receptor (BCR) transcripts to assess isotypes, 
hypermutation levels, and clonal architecture 
of the B cell populations identified above. 
This analysis revealed an isotype and sub- 
class usage pattern that related to cellular 
phenotype (fig. S19A). As expected, naive 
B cells mainly showed a subclass prefer- 
ence for immunoglobulin M (IgM) and IgD. 
Notably, while evidence of class switching 
to IgAl and IgG1 was seen within memory 
B cells (including ABCs), plasmablasts and 
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plasma cells also showed class switching to 
IgA2 and IgG2. 

To determine to what extent this isotype 
subclass bias correlated with the tissue of 
origin, we assessed each cell state indepen- 
dently (requiring a minimum cell count of 
50). Memory B cells showed a bias toward 
IgAl in the mesenteric lymph nodes, and 
toward IgA2 in the ileum, where Peyer’s 
patches are found (Fig. 3E). In the plasma 
cell compartment, we found an even more 
pronounced preference for IgA2 in the gut 
region (specifically in the jejunum lamina 
propria), consistent with the known dom- 
inance of this isotype at mucosal surfaces 
(Fig. 3E). Of note, plasma cells in the bone 
marrow, liver, and spleen were composed 
of >20% IgG2 subclass. With more limited 
numbers, we reported isotype distributions 
across tissues for other B cell populations 
(fig. S19, B and C). ABCs were dominated by 
IgM in both the spleen and lung-draining lymph 
nodes, consistent with previous reports (57). 

Somatic hypermutation levels were, as ex- 
pected, lowest in naive B cells and highest in 
plasma cells (fig. SI9D). Between isotypes and 
subclasses, somatic hypermutation did not 
differ significantly. Nonetheless, there was a 
tendency toward higher mutation rates in 
the distal classes IgG2, IgG4, and IgA2, which 
are downstream from the IgH locus and can 
thus accumulate more mutations during se- 
quential switching (52) (Fig. 3F). We also 
explored the occurrence of sequential class 
switching events in our data by assessing the 
isotype frequency among expanded clonotypes 
(>10 cells). Mixed isotype clones were rare in 
our data (fig. SI9E). 

Next, we evaluated the distribution of ex- 
panded clones across tissues and cell types 
and found three major groups of clones—those 
present in only two tissues, three to four tis- 
sues, or five or more tissues, respectively (Fig. 
3G), similar to previously reported patterns of 
B cell clonal tissue distribution (53). While 
the clones restricted to two tissues, typically 
between the spleen and the liver or bone mar- 
row, were enriched in plasma cells, those dis- 
tributed across more than five tissues, including 
lymph nodes, were over-represented in memory 
B cells. Together, these findings suggest that 
tissue-restricted clones may represent long- 
term immunological memory maintained by 
long-lived plasma cells resident in the bone 
marrow and spleen as well as liver in our data. 

Overall, we characterized nine cell states in 
the B cell compartment and gained insights 
from in-depth characterization of both naive 
and memory B cell as well as plasma cell 
subsets. We identified distinct clonal distribu- 
tion patterns for the more tissue-restricted 
long-lived quiescent plasma cells versus the 
broad tissue distribution of classical memory 
B cell clones. 
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Fig. 4. Tissue compartmentalization and site-specific adaptations of T cells 
and innate lymphoid cells. (A) UMAP visualization of T cells and ILCs across 
human tissues colored by cell types. (B) Dot plot for expression of marker genes 
of the identified immune populations. Color represents maximum-normalized 
mean expression of cells expressing marker genes, and size represents the 
percentage of cells expressing these genes. (C) UMAP visualization of T cells and 
ILCs colored by tissues. (D) Heatmap showing the distribution of each T cell 

or ILC population across different tissues. Cell numbers are normalized within 
each tissue and later calculated as proportions across tissues. Only tissues 


containing >50 ILC or T cells in at least two donors were included. Asterisks 
mark significant enrichment in a given tissue relative to the remaining 

tissues (Poisson regression stratified by donors, P < 0.05 after BH correction). 
(E and F) smFlSH visualization of CD3D, CD8A, and CRTAM transcripts, 
validating the tissue-resident memory CD8* T cell population in the liver and 
lung-draining lymph nodes. (G) TCR repertoire analysis of T cells across tissues. 
Stacked bar plot shows the fraction of cells in a given cluster binned by 
clonotype size. (H) Heatmap showing the repertoire overlap between expanded 
clones (>1 cell) across tissues and donors as determined by Jaccard distance. 


scRNA-segq analysis of T cells and innate 
lymphocytes reveals lineage and 

tissue-specific subsets 

For annotation of the T cell and innate 
lymphocyte compartment, CellTypist iden- 
tified 18 cell types (fig. S20A). After manual 
inspection, these clusters were further di- 
vided into additional subtypes (e.g., for cy- 
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totoxic T cells) (Fig. 4A and fig. S20B). As 
described above for the myeloid and B cell 
compartments, identities of the derived 
cell types were cross-validated in the im- 
mune compartment of the CellTypist train- 
ing datasets (fig. S20, C and D) and the two 
independent studies of gut (27) and lung (28) 
(fig. S20, E and F). 
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Naive/central memory CD4"* T cells were 
transcriptionally close to naive CD8* T cells 
as defined by high expression of CCR7 and 
SELL and were mainly found in lymphoid 
sites (Fig. 4B). Other CD4* T cells identified 
included T follicular helper (Ty) cells ex- 
pressing CXCRS5; regulatory T cells (Tyegs) €X- 
pressing FOXP3 and CTLA4; effector memory 
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CD4* T cells; and tissue-resident memory Ty1 
and Ty17 cells expressing CCR9, ITGAE, and 
ITGAI found largely in intestinal sites (eju- 
num and ileum) and lungs (Fig. 4, B to D). 
Within the memory CD8* T compartment, 
we found three major subsets: Try gut_ 
CD8 (resident memory T cell, Taw), Tememra_ 
CD8 (effector memory T cell, Tyy; effector 
memory reexpressing CD45RA, Tryrpa), and 
Truyem_CD8. These subsets were character- 
ized by differential expression of the chemo- 
kine receptors CCR9 and CX3CRI and the 
activation marker CRTAM (Fig. 4B). The 
Trm_gut_CD8 population (CCR9*) expressed 
the tissue-residency markers ITGAE and 
ITGA1, encoding CD103 and CD49a, respec- 
tively, and localized to intestinal sites (Fig. 4, B 
to D). By contrast, the Tew/emra_CD8 popu- 
lation expressing CX3CRI was found in blood- 
rich sites (blood, bone marrow, lung, and liver) 
and was excluded from lymph nodes and gut 
(Fig. 4, C and D), consistent with previous 
flow cytometry analysis of TEyra Cells (54) 
and results showing CX3CR1*CD8* T cells 
as blood-confined and absent from thoracic 
duct lymph (55). The Tgwyjzm—CD8 popula- 
tion expressed high levels of CRTAM, a gene 
previously shown to be expressed by Tay, cells 
(56) and was found in spleen, bone marrow, 
and to a lesser extent in lymph nodes and 
lungs. This may be a resident population 
more prevalent in lymphoid sites (76). We 
validated and mapped the Tawyzm_CD8 pop- 
ulation using smFISH in the liver (Fig. 4E) 
and lung-draining lymph nodes (Fig. 4F). 
Furthermore, we validated all three memory 
CDs8* T cell populations at the protein level 
using flow cytometry of cells purified from 
human spleen (fig. $21). Although we could 
validate CRTAM at the RNA level by smFISH, 
the protein could not be detected without stim- 
ulation, suggesting that CRTAM is subject to 
posttranslational regulation upon T cell recep- 
tor (TCR) activation. These three distinct pop- 
ulations represent different states of tissue 
adaptation and maturation between effector 
memory and tissue-resident T cell mem- 
ory states. 

We also detected invariant T cell subsets 
such as MAIT cells, characterized by expression 
of TRAVI-2 and SLC4A10, and two populations 
of yd T cells: Tay Tep and Tgp _CRTAM™. The 
CCR9* Tam_Tep population populated the gut 
and expressed the tissue-residency markers 
ITGAE and ITGAI, whereas the Tgp _CRTAM* 
population overexpressed CRTAM, IKZF2 
(encoding HELIOS), and the integrin mole- 
cule ITGAD (encoding CD11d) and was found 
primarily in the spleen, bone marrow, and 
liver (Fig. 4, B to D, and fig. S22, A and B). We 
validated the latter population by quantita- 
tive PCR (qPCR) of flow-sorted CD3*TCRy5* 
and CD3*TCRof* cells from cryopreserved 
spleen samples from three donors (fig. $22, C 
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and D). As asmall fraction of a8 T cells, marked 
by low expression of CD52 and CD127, were 
also noted to express JTGAD, the CD3*TCRaf 
population was split into CD52°CD127° and 
CD52*CD127* subpopulations. In keeping with 
our scRNA-seq data, JTGAD expression was 
high in CD3*TCRy6 and CD52°CD127 CD3* 
TCRaf, providing additional evidence for the 
specific expression of this integrin alpha 
subunit in this subpopulation of y6 T cells. 

Lastly, NK cells in our data were repre- 
sented by two clusters with high expression 
of either FCGR3A (encoding CD16) or NCAM1 
(encoding CD56). We also defined an ILC3 
population within a small cluster mixed with 
NK cells, via expression of markers including 
PCDH9 (Fig. 4, A and B). Analyses of the 
tissue distribution of these populations re- 
vealed that, whereas most of the CD4* T and 
ILC3 cells were located in the lymph nodes 
and to some extent in the spleen, cytotoxic 
T and NK cells were more abundant in the 
bone marrow, spleen, and nonlymphoid tis- 
sues (Fig. 4, C and D). 


TCR repertoire analysis shows clonal 
expansion and distribution patterns within and 
across tissues 


To understand T cell-mediated protection 
in more depth, we analyzed T cell clonal dis- 
tribution in a subset of the data within dif- 
ferent tissues of a single individual and 
across different individuals. Chain pairing 
analysis showed that cells from the T cell clus- 
ters mostly contained a single pair of TCRaB 
chains (50 to 60%), with orphan (5 to 20%) 
and extra (5 to 10%) chains present in small 
fractions of cells (fig. S23A). Notably, the fre- 
quency of extra o chains (extra VJ) was higher 
than that of B chains (extra VDJ), potentially 
because of more stringent and multilayered 
allelic exclusion mechanisms at the TCRB 
locus compared with TCRa (57). As expected, 
the NK and ILC clusters held no productive 
TCR chains. Within the yé T cell clusters, only 
a small proportion had a productive TCR 
chain, which may result from cytotoxic T cells 
co-clustering with yé T cells. We also carried 
out y6 TCR sequencing in selected spleen, 
bone marrow, and liver samples. The y6 TCR 
sequencing data was subjected to a custom- 
ized analysis pipeline that we developed and 
optimized using cellranger followed by contig 
reannotation with dandelion (see materials 
and methods), facilitating the full recovery of 
y6 chains in our data. This analysis confirmed 
that most of the productive yé TCR chains 
originated from the /TGAD-expressing 5 T cells 
(fig. S23B), supporting the robust identification 
of this population. The Tay Tgp population 
could not be confirmed by y5 TCR sequencing 
owing to the lack of sample availability. 

We next examined V(D)J gene usage in rela- 
tion to T cell identity. In the MAIT population, 


13 May 2022 


we detected significant enrichment of TRAVI-2, 
as expected (fig. S23C). Selecting only the 
TRAV1-2" cells (MAIT cluster and other clus- 
ters) revealed a notable tissue-specific dis- 
tribution of TRAJ segments, with TRAJ33 
in spleen and liver, TRAJ12 in liver, and 
TRAJ29/TRAJ36 in jejunum (fig. $23D). 
This suggests that there may be different 
antigens for MAIT cells in the spleen, liver, and 
gut corresponding to the different metabo- 
lomes in these tissues. In addition, full analysis 
of the TCR repertoire of the MAIT cells re- 
vealed previously unappreciated diversity of V 
segment usage in the B chain (fig. S23D). 

We then defined clonally related cells on the 
basis of identical CDR3 nucleotide sequences 
to investigate their TCR repertoires. Using this 
approach, we found that clonally expanded 
cells were primarily from the resident memory 
T cell compartment, including the Ty1 and 
Ty17 populations mentioned above (Fig. 4G 
and fig. S23E). As expected, these clonotypes 
were restricted to single individuals, and 
within an individual they were distributed 
across tissues and subsets (Fig. 4H and fig. 
$23, F to H). We found a small number of 
isolated CD4* T cell clones that shared Tyegs 
and effector T cell phenotypes, possibly owing 
to low levels of plasticity or to (trans)differen- 
tiation from the same naive precursor cell in 
the periphery (fig. S23H). Focusing on the most 
expanded clonotypes (>20 cells), many were 
widespread across five or more tissues, support- 
ing the systemic nature of tissue-resident im- 
mune memory (Fig. 4G). Moreover, we found 
that several clonotypes present across tis- 
sues consisted of a mixture of cells from the 
Trmyemra—CD8 and Tawyjsm_CD8 populations 
(fig. S23H), suggesting that a single naive 
CD8* T cell precursor can give rise to diverse 
cytotoxic T cell states, which harbor immune 
memory across multiple nonlymphoid tissues, 
emphasizing the plasticity of phenotype and 
location within a clone. 

In summary, we have described 18 T or 
innate cell states in our data by integrating 
CellTypist logistic regression models, manual 
curation, and V(D)J sequencing. This analysis 
has yielded insights into the MAIT cell com- 
partment and its antigen receptor repertoire 
distribution that differed between spleen, 
liver, and gut. For the cytotoxic T cell memory 
compartment, there was broad sharing of clones 
across gut regions for Tpy gut_CD8, and 
mixed Tywyemra—CD8 and Tayem_CD8 T cell 
clonotypes with broad tissue distributions. 


A cross-tissue updatable reference of immune 
cell types and states 


After focusing on individual immune com- 
partments, we next took a combined approach 
to better understand the immune landscape of 
selected tissues. As shown in Fig. 5A, each 
tissue has its own immune neighborhood, for 
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Fig. 5. A cross-tissue updatable reference of immune cell types and cell states. (A) Heatmap showing the distribution of manually curated cell types across 


selected tissues. Cell numbers are normalized within each tissue and later calculated as proportions across tissues. Asterisks mark significant enrichment in a given 
tissue relative to the remaining tissues (Poisson regression stratified by donors, P < 0.05 after BH correction). (B) Workflow for the iterative update of CellTypist 
through the periodic incorporation of curated cell type labels. 


example, while spleen and lymph nodes are 
rich in B cells, composition of their myeloid 
compartment varies. In particular, a large pop- 
ulation of erythrophagocytic macrophages, 
known as red pulp macrophages, are evident 
in the spleen (in keeping with their role in red 
blood cell turnover), whereas lymph nodes 
are rich in dendritic cells. As expected, bone 
marrow uniquely contains progenitor popu- 
lations. Furthermore, lung and liver contain 
notable numbers of monocytes, including 
CX3CRI‘ nonclassical monocytes, whereas these 
cells are absent from the jejunum, perhaps re- 
flecting different degrees of vascularization. 
In contrast, the jejunum has an abundance of 
resident memory T cells (CD8" T cells, Ty1 
cells, and Ty,17 cells) as well as plasma cells. 


Dominguez Conde et al., Science 376, eabl5197 (2022) 


Our long-term vision for CellTypist is to 
provide a reference atlas with deeply curated 
cell types publicly available to the community. 
Therefore, via a semiautomatic process, we fed 
the identities of the 41 immune cell types iden- 
tified in our dataset (including both shared and 
novel cell type labels) back into CellTypist, 
demonstrating how CellTypist can be updated 
and improved over time. Combined with the 
initial 91 cell types and states included in the 
reference datasets, CellTypist now comprises 
101 annotated cell types (Fig. 5B). 


Discussion 


Here, we present a multitissue study of im- 
mune cells across the human body in diverse 
organ donors. By sampling multiple organs 
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from the same individuals, which allows for 
robust control of age, sex, medical history, drug 
exposure and sampling backgrounds, we reveal 
tissue-specific expression patterns across the 
myeloid and lymphoid compartments. 

We also introduce CellTypist, a publicly 
available and updatable framework for auto- 
mated immune cell type annotation that, in 
addition to identifying major cell types, can 
perform fine-grained cell subtype annotation— 
normally a time-consuming process that 
requires expert knowledge. We developed 
CellTypist by integrating and curating data 
obtained from 19 studies performed across 
a range of tissues, with in-depth immune 
cell analysis of 91 harmonized cell type labels. 
However, as demonstrated here, for example 
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in the y6 T cell compartment, manual cura- 
tion after automated annotation still has a 
role to play in revealing specific cell subtypes 
that may be absent from the training set. To 
reduce the need for this manual curation, in 
the longer term, the CellTypist models will be 
periodically updated and extended to include 
further immune and nonimmune subpopula- 
tions as more data become available. 

Within the myeloid compartment, macro- 
phages showed the most prominent features 
of tissue specificity. Erythrophagocytic macro- 
phages in the liver and spleen shared features 
related to iron recycling (58) with macrophages 
in other locations, such as the mesenteric 
lymph nodes, suggesting that macrophages 
participate in iron metabolism across a range 
of tissues. In addition, we characterized sub- 
sets of migratory dendritic cells (CCR7*) re- 
vealing specific expression of CRLF2, CSF2RA, 
and GPR157 in the lung and lung-draining 
lymph nodes and expression of AJRE in the 
mesenteric and lung-draining lymph nodes. 
These migratory dendritic cell states are 
interesting targets for future in-depth func- 
tional characterization in the context of al- 
lergy, asthma, and other related pathologies 
(59, 60). 

In the lymphoid compartment, we combined 
single-cell transcriptome and VDJ analysis, 
which allowed the phenotype of adaptive im- 
mune cells to be dissected using comple- 
mentary layers of single-cell genomics data. 
Of note, we detected a subset of memory 
B cells expressing [TGAX (CD1l1c) and TBX21 
(T-bet) that resemble ABCs previously re- 
ported to be expanded in aging (48), follow- 
ing malaria vaccination (67) and in systemic 
lupus erythematosus patients (62). In our data, 
these B cells did not show clonal expansion, 
and at least 50% showed IgM subclass, sug- 
gesting that they may be present at low levels 
in healthy tissues and expand upon challenge 
as well as aging. BCR analysis revealed isotype 
usage biased toward IgA2 in gut plasma cells, 
which may be related to structural differences 
(63) or higher resistance to microbial degrada- 
tion as compared with IgA1 (64). 

In the T cell compartment, our results pro- 
vided insights into the heterogeneity of T cell 
subtypes and their tissue adaptations. Nota- 
bly, we identified subsets of CD4* Tp on the 
basis of functional capacity for interferon-y 
or IL-17 production that were mostly localized 
to intestinal sites, analogous to mouse CD4* 
Tro generated from IL-17-producing effec- 
tor T cells in the gut (65). We also identified 
different subsets of CD8* Try including a 
gut-adapted subset expressing CCR9, which 
mediates homing to intestinal sites via bind- 
ing to CCL25 (66) and another Tpy,-like sub- 
set more targeted to lymphoid sites. TCR 
clone sharing between memory subtypes of 
CD8* T cells suggests their origin from a 
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common precursor, or their differentiation 
or conversion during migration or main- 
tenance, such as conversion of Ty cells to 
Tro cells (56). We also identified distinct 
subsets of yd T cells on the basis of tissue- 
specific gene expression patterns, showing 
distinct integrin gene expression and tissue 
distributions. 

In summary, using this dataset of nearly 
360,000 single-cell transcriptomes (of which 
~330,000 were immune cells) from donor- 
matched tissues from 12 deceased individu- 
als, we have shown how a combination of 
CellTypist-based automated annotation, expert- 
driven cluster analysis, and antigen receptor 
sequencing can synergize to dissect specific 
and functionally relevant aspects of immune 
cells across the human body. We have re- 
vealed previously unrecognized features of 
tissue-specific immunity in the myeloid and 
lymphoid compartments, and have provided 
a comprehensive framework for future cross- 
tissue cell type analysis. Further investigation 
of human tissue-resident immunity is needed 
to determine the effect of important covariates 
such as underlying critical illness, donor age, 
and gender, as well as considering the im- 
mune cell activation status, to gain a defining 
picture of how human biology influences 
immune functions. Our deeply character- 
ized cross-tissue immune cell dataset has 
implications for the engineering of cells for 
therapeutic purposes and addressing cells to 
intended tissue locations, as well as for under- 
standing tissue-specific features of infection 
as well as distinct modes of vaccine delivery 
to tissues. 


Materials and methods summary 
Tissue acquisition, processing, and 
single-cell sequencing 


Tissue was obtained from deceased organ 
donors through the Cambridge Biorepository 
for Translational Medicine (CBTM, https:// 
www.cbtm.group.cam.ac.uk/), REC 15/EE/ 
0152. Detailed sample locations taken can 
be found in Fig. 1, and protocols are described 
in detail in supplementary materials. Addi- 
tional tissue samples were from Columbia 
University and were obtained from deceased 
organ donors at the time of organ acquisition 
for clinical transplantation through an ap- 
proved protocol and material transfer agree- 
ment with LiveOnNY. 

Six donors were processed with a uniform 
protocol at Cambridge University, where solid 
tissues were cut into small pieces and then 
homogenized with two rounds of enzymatic 
digestion at 37°C for 15 min with mixing. 
The remaining six donors were subjected to 
a tissue-adapted protocol with the aim of im- 
proving immune cell recovery, and this pro- 
tocol was harmonized as closely as possible 
between the two collection sites. 
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For scRNA-seq experiments, single cells were 
loaded onto the channels of a Chromium chip 
(10x Genomics). cDNA synthesis, amplification, 
and sequencing libraries were generated using 
either the Single Cell 5’ Reagent (v1 and v2) 
(Cambridge University) or 3’ Reagent (v3) 
(Columbia University) Kit. TCRaB, BCR, and 
TCRy6 paired VDJ libraries were prepared 
from samples made with the 5’ Reagent kit. 
All libraries were sequenced on either a 
HiSeq 4000 or NovaSeq 6000 instrument. 


scRNA-seq and scVDJ-seq data analysis 


scRNA-seq data was aligned and quantified 
using the cellranger software (version 6.1.1, 10x 
Genomics). Cells from hashtagged samples 
were demultiplexed using Hashsolo (67). 
Cells with fewer than 1000 unique molecular 
identifier counts and 600 detected genes 
were excluded. Doublets were detected using 
Scrublet (68). Downstream analysis from 
data normalization to graph-based clustering 
were performed using Scanpy (version 1.6.0) 
(69), with details described in supplementary 
materials. Data integration was done using 
BBKNN (70) and scVI (77), and the results were 
compared using kBET (72). 

Single-cell TCR sequencing and single-cell 
BCR sequencing data were aligned and quan- 
tified using the cellranger-vdj software (ver- 
sions 2.1.1 and 4.0, respectively). For TCRy5, we 
implemented a customized pipeline (https:// 
sc-dandelion.readthedocs.io/en/latest/note- 
books/gamma_delta.html) owing to cellranger 
being tuned toward alpha/beta TCR chains. 
Single-cell TCR sequencing analysis includ- 
ing productive TCR chain pairing, and clono- 
type detection was performed using the 
scirpy package (73). 


CellTypist 


Details of CellTypist, including cross-data cell 
type label harmonization and automated cell 
annotation, can be found in the supplementary 
text in the supplementary materials. Briefly, 
immune cells from 20 tissues of 19 studies were 
collected and harmonized into consistent 
labels. These cells were split into equal-sized 
mini-batches, and these batches were sequen- 
tially trained by the 12-regularized logistic 
regression using stochastic gradient descent 
learning. Feature selection was performed to 
choose the top 300 genes from each cell type, 
and the union of these genes was supplied as 
the input for a second round of training. 


Single-molecule FISH, flow cytometry, qPCR, 
and immunofluorescence 


For smFISH, samples were run using the 
RNAscope 2.5 LS fluorescent multiplex assay 
(automated). Slides were imaged on the Perkin 
Elmer Opera Phenix High-Content Screening 
System, in confocal mode with 1-um z-step size, 
using 20x (NA 0.16, 0.299 um per pixel) and 
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40x (NA 1.1, 0.149 um per pixel) water- 
immersion objectives. 
For flow cytometry, mononuclear cells were 
either stained ex vivo or after activation with 
phorbol 12-myristate 13-acetate/ionomycin 
for 2 hours. Cells were stained with the live/ 
dead marker Zombie Aqua for 10 min at 
room temperature and then washed with 
phosphate-buffered saline (PBS) and 0.5% 
fetal calf serum (FCS), with the CD8 and B cell 
panels of antibodies. 
qPCR was performed in three spleen sam- 
ples. Cells were stained with the live/dead 
marker Zombie Aqua for 10 min at room 
temperature and then washed with PBS 
+0.5%FCS, followed by staining with the 
antibodies at 4°C for 45 min. Cell sorting 
was performed on a BD Fusion 4 laser sorter, 
and RNA was extracted using a Zymo Re- 
search RNA micro kit. 

For immunofluorescence, samples were fixed 
in 1% paraformaldehyde for 24 hours followed 
by 8 hours in 30% sucrose in PBS and were 
stained for 2 hours at room temperature with 
the appropriate antibodies, washed three times 
in PBS, and mounted in Fluoromount-G 
(Southern Biotech). Images were acquired 
using a TCS SP8 (Leica, Milton Keynes, UK) 
confocal microscope. 
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INTRODUCTION: Bacteria within the gut con- 
tinuously adapt their gene expression to envi- 
ronmental conditions that are associated with 
diet, health, and disease. Noninvasive measure- 
ments of bacterial gene expression patterns 
throughout the intestine are important to under- 
stand in vivo microbiota physiology and patho- 
physiology. Current methods do not offer sufficient 
information about transient or proximal events 
within the intestine without using indirect or 
invasive approaches that disturb normal phys- 
iology and are inapplicable to clinical practice. 


RATIONALE: Transcriptional recording by 
CRISPR spacer acquisition from RNA (Record- 
seq) enables engineered bacteria to contin- 
uously record the history of gene expression in 
a population of bacteria. Over time, snippets of 


Transcriptional recording 
sentinel cells 


Proximal 


ou 


In vivo recording of microbial 
gene expression 


intracellular RNA are converted into DNA and 
integrated as a historical record of spacer se- 
quences within CRISPR arrays through the ac- 
tion of an integration complex that contains a 
reverse transcriptase Cas1 fusion protein (RT- 
Casl) and Cas2. Here, using a refined Record-seq 
methodology, we used transcriptional recording 
Escherichia coli sentinel cells to reveal intesti- 
nal and microbiota physiology under different 
dietary and disease contexts along the length 
of the unmanipulated mouse intestine. 


RESULTS: We used transcriptional recording 
sentinel cell technology in the gastrointestinal 
tract of germ-free and gnotobiotic mice to as- 
sess how the DNA record of spacer sequences in 
fecal samples uncovered distinct transcriptional 
records during passage from the proximal to 
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Transcriptional recording sentinel cells noninvasively report interactions with diet, host, other microbes, 
and pathological environments. Throughout intestinal transit, sentinel cells capture information about transient 
mRNA expression into plasmid-encoded CRISPR arrays through the action of a reverse transcriptase Casl-Cas2 


complex. This information is retrieved by means of fecal sampling and deep sequencing followed by computational 
analyses. Barcoded CRISPR arrays enable transcriptional. profiling of isogenic bacteria coinhabiting the intestine. 
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the distal intestine, which depended on diet, 
inflammation, and microbe-microbe interactions. 

Sentinel cells retrieved from feces of stably 
colonized germ-free mice accumulated new 
spacers over time and during intestinal tran- 
sit. Fecal Record-seq profiles were distinct for 
mice on chow diet versus starch or fat diets. 
Transcriptional records of these diets were pre- 
served in fecal spacer sequences even 2 weeks 
after a dietary switch, whereas diet-specific fe- 
cal bacterial RNA-sequencing (RNA-seq) pro- 
files were rapidly lost. Direct measurements 
showed that Record-seq efficiently captured 
proximal transient transcriptional events. This 
included evidence of different carbon source 
preferences and bacterial hexuronate metabo- 
lism through the Entner-Doudoroff pathway 
under conditions of restricted carbon source 
availability, which was then verified through 
competitive colonization of wild-type (WT) 
E. coli and a AidnK/AgntK mutant defective in 
hexuronate catabolism. In addition to informa- 
tion about carbon source preference and me- 
tabolism, the transcriptome-scale records also 
provided evidence of an acid-stress response 
that was associated with a lowered pH in the 
cecum of starch-fed mice. In a dextran sulfate 
sodium (DSS) colitis model, sentinel cells re- 
corded transcriptional alterations consistent 
with reduced anaerobic metabolism, a stringent 
response, and increased oxidative and mem- 
brane stress. Cocolonization with Bacteroides 
thetaiotaomicron revealed likely cross-feeding 
of E. coli from records of uptake and metab- 
olism of fiber-derived saccharides liberated by 
B. thetaiotaomicron glycoside hydrolases. 

Record-seq was also able to capture diet- 
specific signatures in mice colonized with a 
12-organism model microbiota. Moreover, by 
using barcoded CRISPR arrays, we could show 
that Record-seq can be multiplexed in several 
strains of the same bacterial species that co- 
colonize the intestine, thus elucidating the com- 
pensatory response of a single-gene mutant to 
competition with the WT strain. 


CONCLUSION: Transcriptional recording senti- 
nel cells function in vivo in the mouse intestine 
and record transcriptome-scale information 
about diet, disease, and microbial interactions 
integrated along the length of the intestinal 
tract over time. Transcriptional recording en- 
ables noninvasive measurement of the intes- 
tinal tract with potential for biomedical research 
and future biomedical diagnostic applications. 
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Transcriptional recording by CRISPR spacer acquisition from RNA endows engineered Escherichia coli with 
synthetic memory, which through Record-seq reveals transcriptome-scale records. Microbial sentinels 

that traverse the gastrointestinal tract capture a wide range of genes and pathways that describe interactions 
with the host, including quantitative shifts in the molecular environment that result from alterations in the 
host diet, induced inflammation, and microbiome complexity. We demonstrate multiplexed recording using 
barcoded CRISPR arrays, enabling the reconstruction of transcriptional histories of isogenic bacterial 
strains in vivo. Record-seq therefore provides a scalable, noninvasive platform for interrogating intestinal and 
microbial physiology throughout the length of the intestine without manipulations to host physiology 
and can determine how single microbial genetic differences alter the way in which the microbe adapts 


to the host intestinal environment. 


ngineered microorganisms that harbor 

molecular recording technologies enable 

the conversion of cellular histories into 

heritable nucleotide sequence archives 

encoded in DNA. Current recording tools— 
based on recombinases, single-stranded DNA 
recombineering, CRISPR-Cas9, and CRISPR 
spacer acquisition—are emerging as valuable 
platforms for encoding diverse biological fea- 
tures into DNA, including environmental metab- 
olite concentration, cell lineage, gene expression, 
and horizontal gene transfer (7-7). These tech- 
nologies can illuminate complex dynamic cel- 
lular processes but have largely only been 
deployed in vitro. An unmet need is to lever- 
age molecular recording to engineer sentinel 
cells capable of traversing the gastrointestinal 
tract and report on microbial and host physi- 
ology in the otherwise inaccessible intestinal 
lumen. Current sentinel cell technologies use 
biosensors to report on single biomolecules 
(8-13) but fall considerably short of capturing 
the complexity of the mammalian gastrointes- 
tinal tract. 

The contents of the mammalian intestine 
are exposed to and shape dramatically differ- 
ent luminal environments during transit and 
colonization (J4). The microorganisms that 
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inhabit the small intestine and colon adapt 
their gene expression to environmental changes 
associated with nutrients and pathological 
states. Metatranscriptomics from fecal bacte- 
ria are largely uninformative about adapta- 
tions to the proximal luminal environment 
(15), sampling of which requires invasive or 
disruptive procedures or use of devices that 
cannot preserve transient signals (16). Because 
metabolites and RNA are short-lived, omics- 
based measurements of transient stimuli only 
yield a snapshot of highly dynamic processes. 
An integrated measure of intestinal function 
therefore requires a noninvasive system that is 
able to sample a wide range of conditions and 
to preserve proximal transient signals in fecal 
samples. 

Our overriding goal in this work was to use 
molecular recording to establish a scalable, 
noninvasive sentinel cell system for assess- 
ing intestinal and microbial physiology. To 
achieve this, we used Record-seq (17), which 
uses the CRISPR spacer adaptation complex 
of Fusicatenibacter saccharivorans (F'sRT-Cas1- 
Cas2) to acquire snippets of cellular RNAs 
as DNA within CRISPR arrays that can later 
be sequenced to reveal memory of past mi- 
crobial gene expression. Record-seq captures 
transcriptome-scale information about the in- 
testinal environment, recording the differ- 
ences in microbia-host interactions according 
to diet, disease, or alterations of microbiota 
composition. 


Results 
Transcriptional recording sentinel cells acquire 
transcriptional records within the mouse gut 


To establish transcriptional recording as a non- 
invasive recording tool in the mouse intestine, 
we orally gavaged germ-free C57BL/6(J) mice 
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with Escherichia coli MG1655 carrying an an- 
hydrotetracycline (aTc)-inducible transcrip- 
tional recording plasmid (Fig. 1A). Comparisons 
between Record-seq on sequentially collected 
fecal samples and contents of the ileum, ce- 
cum, and colon revealed that sentinel cells 
acquired increasing transcriptional records 
(E. coli genome-aligning spacers) throughout 
time and position along the intestinal tract 
(Fig. 1, B and C, and fig. S1, A and B) and 
according to aTc concentration (fig. SIC). The 
functional characteristics of /sRT-Cas1-Cas2 
(fig. S1, D to H) were consistent with our pre- 
vious in vitro experiments (17, 78). 


Transcriptome-scale recording of complex and 
dynamic intraluminal environments 


We next assessed the capacity of transcrip- 
tional recording sentinel cells to record differ- 
ences in the intestinal environment by varying 
the animals’ diet. Mice monocolonized with 
sentinel cells were fed one of three diets: a 
standard chow or a purified diet based on 
either starch or fat (referred to as starch or 
fat diets below). Starting from day 7, all groups 
received the chow diet (Fig. 1D and table S1). 
RNA-sequencing (RNA-seq) and Record-seq of 
fecal samples enabled the characterization of 
transcriptional changes and the stability of the 
recorded information. Before the diet switch, 
both RNA-seq and Record-seq readily distin- 
guished the diet groups (fig. S2, A and B). After 
the switch to chow, the transcriptional signa- 
tures corresponding to starch or fat diets were 
rapidly lost with RNA-seq but not Record-seq 
(Fig. 1, Eand F, and fig. S2, C to E). Thus, RNA-seq 
represents a snapshot measurement, whereas 
Record-seq durably reveals past transcriptional 
information. 

To verify the reproducibility of Record-seq, 
we replicated the first 14 days in an indepen- 
dent experiment, largely confirming our pre- 
vious observations (fig. S3, A to H). We also 
directly compared the two experiments and 
found a 95% overlap in the regulation of 
Record-seq differentially expressed genes (DEGs) 
(fig. S3I and tables S2 and S3). Thus, tran- 
scriptional recording sentinel cells reproduc- 
ibly record cellular transcriptomes and provide 
an archive of complex and dynamic features 
of different mammalian intestinal environ- 
ments in vivo. 


Record-seq reveals adaptation of E. coli to 
intraluminal conditions 


Record-seq characterization of the genes and 
pathways altered in its response to different 
diets delivered a detailed picture of FE. coli’s 
adaptation to intraluminal environments. DEGs 
(table S2) readily classified the three diet con- 
ditions upon hierarchical clustering (Fig. 2A 
and table S4), and pairwise comparisons be- 
tween each diet yielded hundreds of DEGs 
(Fig. 2B and fig. S4, A and B). 
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Fig. 1. Transcriptional recording 
sentinel cells acquire transcrip- 
tional records within the 
mouse gut and preserve this 
information throughout time. 
(A) Schematic of experimental 


CRISPR 
array 


E. coli 


— 
transform 


| 
CRISPR-Cas transcriptional 


transcriptional recording 


workflow for in vivo experiments recorder for Record-seq sentinel cell 
with transcriptional recording 
sentinel cells. E. coli with B Cc 
; P : 2 
recording plasmid encoding @ 1.5E+05 
. 
aTc-inducible FsRT-Cas1-Cas2 
and a CRISPR array were orally = 2 1.0E+05 é 
. ; 85 
gavaged into mice. Record- 2 
Ws 5 .0E+04 
seq was performed on feces 2 é 
j j ce} 
or intestinal contents. (B and = 0.0E+00 
oD 


C) Numbers of E. coli genome- 
aligning spacers (B) obtained 
from feces on the indicated days 
and (C) from the indicated 
intestinal sections on day 20 
after gavage. (B) and (C) 

show mean + SEM of n=5 
independent biological repli- 
cates. (D) Timeline of longitudi- 
nal in vivo recording experiment 
assessing the impact of diet 

on the intestinal E. coli tran- 
scriptome. Germ-free mice were E 
supplied with aTc in the drinking 
water and gavaged with E. coli 

G1655 transformed with the 
ecording plasmid. Mice were fed 

a chow, fat, or starch die 
starting 2 days before gavage 
until day 7. After day 7, all 
groups received the chow diet. 
Fecal RNA/Record-seq sampling 


UMAP 2 


0.0 


traverse and record 
intestinal environment 


E. coli 
genome-aligning spacers 


i} 
— 
Record-seq a 


non-invasive assessment 
of gut function 


Al 


ileum cecum colon feces 
source 


cellular history 
stored in DNA 


2.5E+05 
2.0E+05 
1.5E+05 
1.0E+05 
5.0E+04 
0.0E+00 


Day -2-10 12 3 4 5 6 7 8 Q9 10 11 12 13 14 15 16 17 18 19 20 


@ chow 


@ fat 
@ starch 


or 
or 


: 
aTc 


% E. coli 
germ-free 


Record-seq 


RNA-seq e e 


RNA-seq - UMAP 


is indicated, with day 7 samples 
being collected before changing 
the diets. Data for days 1 to 6, 8, 
O, 11, 13, and 16 are shown in 
(B) and figs. SIA and S2C. (E and 
F) UMAP embedding of (E) 
RNA-seq or (F) Record-seq data 
from E. coli under chow (blue), fat 
(orange), or starch (green) diet on 
indicated days corresponding to 


UMAP 14 
day 07090120 14 © 20 
@ chow @ fat @ starch 


UMAP 2 


Record-seq - UMAP 


UMAP 1 
day 07090120 14© 20 
@ chow @ fat @ starch 


(D). Dot sizes indicate successive time points; n = 5 independent biological replicates. Count thresholds were 10* (Record-seq) and 10° (RNA-seq). Outliers were 
excluded on the basis of modified z-score and relative deviation from the mean. 


Pathway enrichment analysis (tables S5 and 
S6) (19) revealed a number of diet-dependent 
shifts in a wide range of cellular behaviors 
among the diet conditions (Fig. 2C and fig. S4, 
Cand D). For further analysis, we focused on 
a comparison of chow versus starch diets. In 
starch-fed animals, we found signals indica- 
tive of nitrate use as an electron acceptor for 
anaerobic metabolism (narL), adaptation to 
low pH (gad), potassium uptake (kdpDE), 
and carbohydrate utilization. In chow-fed ani- 
mals, FE. coli used diverse carbon sources as 
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suggested by overrepresentation of utilization 
pathways for galactonate, glucarate, galacta- 
rate, sulfoquinovose, arabinose, galactose, ribose, 
and fucose (Fig. 2C and fig. S4E). These find- 
ings are in line with the diverse composition of 
the chow diet, which is enriched in plant mate- 
rial (table S1), as well as the previously de- 
scribed adaptation of intestinal symbionts to 
diverse polysaccharide carbon sources (20, 27). 

Record-seq characterization of mice on the 
starch diet revealed the metabolic adaptation 
of E. coli in vivo to more restricted carbon 
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source availability. The sugar acids galactur- 
onate and gluconate are present in the host 
mucus and were shown to be used as carbon 
sources through the Entner-Doudoroff path- 
way (EDP) for E. coli colonization in streptomycin- 
treated mice (22-24). In this study, we could 
use Record-seq directly in an unperturbed sys- 
tem in which the only manipulation was a diet 
change, revealing that gluconate and galactur- 
onate are alternative carbon sources in the 
face of nutritional limitation as shown by en- 
richment for their catabolic pathways (Fig. 2C 
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Fig. 2. Record-seq reveals transcriptional 
changes describing the adaptation 

of E. coli to diet-dependent intraluminal 
environments. (A to C) Record-seq data 

from day 7 feces of mice fed a chow (blue), 

fat (orange), or starch (green) diet. (A) Heatmap 
showing hierarchical clustering by using 

1183 differentially expressed genes (DEGs). 
z-score standardized gene-aligning spacer 
counts are shown. (B) Volcano plot 
with DEGs (Pag < 0.1, logoFC > 1.5) indicated. 
(C) Pathways and transcriptional and 
translational regulators enriched per diet 

group by use of EcoCyc. Dot sizes indicate 
gene numbers detected as significantly up- 
regulated for the respective pathway. (D) Box 
plot showing vst-transformed E. coli genome- 
aligning spacer counts for selected genes 
involved in gluconate metabolism corresponding 
to the indicated diets. (E) Competitive coloniza- 
tion experiment. Germ-free mice fed either a 
chow or starch diet were orally gavaged with a 
1:1 mixture of WT E. coli MG1655 and E. coli 
MG1655 AidnK/AgntK (mut). Competitive indices 
were calculated from fecal recoveries as the 
ratio of mutant to WT CFU (mean + SEM, 

n = 5 independent biological replicates, 

P = 3.93 x 10° likelihood ratio test represen- 
tative result of two independent experiments). 
(A) to (D) correspond to Fig. 1D; n = 5 
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and fig. S4F). These changes included key 
genes of gluconate metabolism, which were 
strongly up-regulated in E. coli from starch-fed 
mice compared with chow-fed animals, such 
as the low- and high-affinity gluconate trans- 
porters (gntU and gntT, respectively), the 
permease for 2-keto-3-deoxygluconate (KdgT), 
gluconate kinase (gntK), as well as the central 
enzymes of the EDP (edd and eda) (Fig. 2D). 
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To confirm gluconate carbon source adap- 
tation directly, we carried out competitive col- 
onizations with wild-type (WT) E. coli MG1655 
and mutant E. coli MG1655 AidnK/AgntK (Fig. 
2E). This mutant is unable to catabolize gluco- 
nate because it lacks both isoforms of the 
gluconate kinase, so we expected it to suffer 
from a substantial competitive disadvantage 
compared with WT cells. The competitive dis- 
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advantage of FE. coli MG1655 AidnK/Agntk 
over the WT cells was nearly 10-fold greater 
in starch- versus chow-fed mice (Fig. 2E), sug- 
gesting that FE. coli becomes more dependent 
on sugar acids from the host mucus purely 
because of carbon source changes. Thus, 
Record-seq delivers a detailed assessment of 
individual genes and entire pathways altered in 
E. col’s adaptation to different diet-dependent 
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intraluminal environments without confound- 
ing manipulations. 


Sentinel cells capture the intestinal milieu 
and preserve transient features of the cecum 
and colon 


An important requirement of any intestinal 
reporting system that avoids potentially con- 
founding manipulations or animal sacrifice 
is the ability to capture information from the 
microbial biomass of inaccessible proximal 
gut sections. Given the ability of Record-seq 
to preserve information over time, we directly 
addressed whether the sentinel cells retain 
information along the longitudinal axis of 
the gastrointestinal tract by comparing fecal 
Record-seq with RNA-seq of EF. coli from dif- 
ferent intestinal segments of mice fed a chow 
or starch diet. We found that ~46% of the fecal 
Record-seq DEGs were identified as differen- 
tially expressed in the same direction by fecal 
RNA-seq (Fig. 3A and table S7), highlighting 
concordance between the two technologies. 
However, ~54% of the genes detected by fecal 
Record-seq as up-regulated under the chow or 
starch diet were found to either be oppositely 
regulated or show no differences between the 
diets in fecal RNA-seq (Fig. 3A and table S7). 

We considered that fecal Record-seq cap- 
tures transient events in proximal sections 
that are absent from fecal RNA-seq. After 
using RNA-seq to confirm that the transcrip- 
tional states of E. coli from different gut seg- 
ments or feces were distinct (fig. S5, A to D), 
we addressed the origin of the 54% of genes 
identified only by Record-seq in the chow con- 
dition (Fig. 3A and table $7). A substantial pro- 
portion (33%) of these genes were explained by 
segment-specific RNA-seq signals in the ce- 
cum and proximal colon, and a further 10% 
were explained by expression in the cecum 
and throughout the colon. Very few (1%) were 
expressed only in the distal colon. The remain- 
ing 56% identified with Record-seq but not 
RNA-seq were potentially differentially ex- 
pressed in niches not covered by the sampling 
performed here (such as the small intestine 
or colonic mucus) or reflected successive tran- 
scriptional events that fell below detection 
thresholds in RNA-seq. Results from the starch 
diet were comparable with these findings, al- 
though fewer genes were selectively expressed 
proximally, likely owing to transit effects 
arising from differences between the fiber 
content (25) of the diets (Fig. 3A and table S7). 
Rank-based hierarchical clustering performed 
on genes overexpressed in the cecum com- 
pared with other sections revealed better 
cecal RNA-seq clustering with Record-seq 
than with any other RNA-seq dataset, confirm- 
ing that proximal information is preserved 
with Record-seq (Fig. 3B and fig. S5E). 

To validate a subset of insights regarding 
proximal gut sections revealed by fecal Record- 
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seq, we followed up on two findings using 
independent methodologies. First, wxaC—part 
of the hexuronate catabolism pathway—was 
distinctly up-regulated in fecal Record-seq and 
proximal gut RNA-seq under the starch diet 
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(Fig. 3C), suggesting that hexuronates are pre- 
ferred carbon sources for £. coli under these 
conditions. In a competitive colonization be- 
tween WT E. coli and a mutant deficient in 
hexuronate catabolism (AuwxaC), the AwxaC 
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Fig. 3. Record-seq sentinel cells capture the milieu along the length of the intestine and preserve tran- 
sient features of proximal large intestinal segments. Mice colonized with Record-seq sentinel cells were fed a 
chow or starch diet for 7 days. (A) Stacked bar plots showing fractions of Record-seq DEGs that were also 
detected with RNA-seq from feces or intestinal segments as up-regulated under a chow (430 genes) or starch 
diet (278 genes). Record-seq DEGs not identified with fecal RNA-seq (left bar) are categorized according to 
differential RNA expression in isolated gut sections (right bar): cecum or proximal colon (“proximal”), distal colon 
(“distal”), or both proximal and distal sections (“distal and proximal”). (B) Heatmap of cecum signature genes 
(213 genes overexpressed in the cecum) showing hierarchical clustering of rank-normalized RNA-seq and Record- 
seq data from the indicated intestinal sections from mice fed a chow diet. (C) Box plot showing E. coli rank- 
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normalized counts of uxaC as determined with Record-seq or RNA-seq from feces, cecum, proximal colon, and 
distal colon corresponding to the indicated diets on day 7. Data in (A) to (C) are a combined analysis of n = 3 
biological replicates per group, each pooled from n = 3 individual mice for gut sections RNA-seq, and n = 5 
biological replicates per group for fecal RNA-seq and Record-seq. Count thresholds were 10* (Record-seq) and 
0° (RNA-seq). Outliers were excluded on the basis of modified z-score and relative deviation from the mean. 
(D) Competitive colonization experiment. Germ-free mice fed either a chow or starch diet were orally gavaged with 
a 1:1 mixture of WT E. coli MG1655 and E. coli MG1655 AuxaC. Competitive indices were calculated from fecal 
atios of mutant to WT CFU [mean + SEM, n = 5 independent biological replicates, P = 0.0022 likelihood ratio test 
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Fig. 4. Record-seq provides a non- 
invasive assessment of DSS-induced 
intestinal inflammation. (A) Timeline of 
DSS colitis recording experiment. 
Germ-free mice were supplied with aTc in 
the drinking water; gavaged with E. coli 
BL21(DE3) sentinel cells; and received 1, 2, 
or 3% DSS or water as indicated. Fecal 
Record-seq sampling is indicated. 

(B) UMAP embedding of Record-seq data 
for control mice (blue) or mice treated with 
1% (salmon), 2% (red), or 3% (black) DSS. 
Dot sizes indicate successive time points. 
ice receiving 3% DSS had to be euthan- 
ized on day 13. (C) Heatmap showing 
hierarchical clustering of Record-seq DEGs 
from control mice (blue) or mice treated 
with 1% (salmon) or 2% (red) DSS, day 
19. z-score standardized gene-aligning 
spacer counts are shown. (D) Timeline of 
DSS colitis recording experiment. 
Germ-free mice were supplied with aTc in 
the drinking water, gavaged with E. coli 
MG1655 sentinel cells, and given 2% DSS 
or water as indicated. Fecal Record-seq 
sampling is indicated. (E) PCA-projected 
Record-seq data on days 7, 8, 10, 14, 

17, and 20 for control mice (blue) or mice 
treated with 2% DSS (red). K-medoids 
clusters are indicated with convex hulls. 
Dot sizes indicate successive time points. 
(F) Dot plot showing logaFC for Record-seq 
DEGs identified for control mice (blue) 

or mice treated with 2% DSS (red). Dot 
sizes increase with significance (Pq range, 
9.8 x 10°'° to 1.0). (G and H) STRING 
analysis of DEGs significantly (G) up- 
regulated or (H) down-regulated under 
DSS treatment compared with that of 
control. Node size increases with 

logaFC [(G) 0.5 to 4.7; (H) 0.5 to 2.9]. 

(B) and (C) correspond to the experiment 
in (A), with n = 3 independent biological 
replicates. (E) to (H) correspond to 

the experiment in (D), with n = 3 to 

4 independent biological replicates of 
each condition. Count thresholds were 

5 x 103 in (B) and (C) and 10* in (E) to (H). 
Outliers were excluded on the basis of 
modified z-score and relative deviation 
from the mean. 
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strain exhibited a significantly greater com- 
petitive disadvantage in starch-fed compared 
with chow-fed mice (Fig. 3D). Second, acid- 
stress response genes (gadABC and hdeAB) 
were selectively overexpressed in fecal Record- 
seq and proximal gut RNA-seq under the starch 
diet (fig. S5F). Accordingly, the pH of the 
cecum under the starch diet was significantly 
reduced compared with that under the chow 
diet (fig. S5G). Thus, fecal Record-seq reports 
transcriptional events throughout the length 
of the intestinal tract and contains informa- 
tion absent from fecal RNA-seq analysis. 


Record-seq provides a noninvasive assessment 
of intestinal inflammation 


Another potential application of sentinel cells 
is their use as noninvasive living diagnostics 
capable of reporting on gastrointestinal dis- 
eases (10). To test this concept, we used the 
dextran sulfate sodium (DSS)-induced colitis 
mouse model. After confirming that DSS had 
negligible direct impact on the E. coli tran- 
scriptome in vitro (fig. S6A), we performed 
Record-seq on sequentially collected fecal 
samples from mice treated with increasing 
concentrations of DSS (Fig. 4A). Record-seq 
not only distinguished treated versus control 
mice during DSS exposure and withdrawal 
but could also accurately report on the pheno- 
typic severity of the colitis model (Fig. 4, B 
and C, and fig. S6, B to E). 

Using a longer time course and only the 2% 
DSS condition, we repeated the in vivo lon- 
gitudinal recording experiment (Fig. 4D and 
fig. S6F) to characterize the intraluminal 
changes that result from DSS-induced inflam- 
mation throughout time. We first confirmed 
that transcriptional records could distinguish 
DSS-treated and control mice and identify 
DEGs throughout the duration of the experi- 
ment (Fig. 4, E and F; fig. S6, G and H; and 
tables S4 and S8). Next, we analyzed the DEGs 
with EcoCyc pathway enrichment (table S9), 
Gene Ontology (GO), Kyoto Encyclopedia of 
Genes and Genomes (KEGG) (table S10), and 
Search Tool for the Retrieval of Interacting 
Genes/Proteins (STRING) network tools dur- 
ing and after inflammation and determined 
that genes required for bacterial membrane 
integrity (pspG) and post-stress persister cell 
formation (mqsAR and hha) (26) were up- 
regulated (Fig. 4, F to H; fig. S61; and table S8). 
We also observed the up-regulation of chap- 
erones (spy and cpxP) and heat-shock proteins 
(ibpA and ibpB) that are induced by oxidative 
stress (27, 28). By contrast, genes and path- 
ways most prominently down-regulated in 
DSS-induced colitis included those required 
for the nitrate and dimethyl sulfoxide (DMSO) 
electron acceptor pathways (narGH and dmsAB), 
adaptation to low pH (gad), and the EDP (edd 
and eda). Down-regulation of the EDP under 
inflamed conditions likely senses the mucus 
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depletion that characterizes intestinal inflam- 
mation (29). These findings also suggest that 
E. coli shows decreased anaerobic respiration 
during inflammation but increased levels of 
envelope and oxidative stress, likely resulting 


from augmented luminal oxygenation (30). 
Several ribosomal protein operons were also 
down-regulated under DSS inflammatory con- 
ditions. Transcription of ribosomal proteins 
is regulated by ppGpp and DksA as part of 
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Fig. 5. Record-seq illuminates both host-microbe and microbe-microbe interactions. (A) Timeline of 
longitudinal in vivo recording experiment on interaction of E. coli with B. theta in the gut. Germ-free mice were 
supplied with aTc in the drinking water and gavaged with E. coli MG1655 sentinel cells alone or together 
with B. theta. Fecal Record-seq sampling is indicated. (B) UMAP embedding of Record-seq data from E. coli in 
the presence (yellow) or absence (blue) of B. theta on the indicated days. Dot sizes indicate successive 
time points. (©) Heatmap showing hierarchical clustering of Record-seq data from EF. coli in the presence 
(yellow) or absence (blue) of B. theta on the indicated days by using identified DEGs. z-score standardized 
gene-aligning spacer counts are shown. (D) Pathways and transcriptional and translational regulators 
identified as enriched (P < 0.05) by use of EcoCyc in E. coli in the presence (yellow) or absence (blue) of 
B. theta on days 2 to 27. Dot sizes indicate gene numbers significantly up-regulated for the respective 
pathway. (E) Schematic depicting nutrient cross-feeding relationship between E. coli and B. theta inferred 
by Record-seq. E. coli genes encoding transporters and enzymes are depicted with color codes reflecting 
their Record-seq—based log2FC of up-regulation in the presence versus absence of B. theta (0 to 5.0). (B) to 
(E) correspond to (A), with n = 4 independent biological replicates of each condition. Count threshold was 
5 x 10°. Outliers were excluded on the basis of modified z-score and relative deviation from the mean. 
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the stringent response to nutrient limitation 
(31, 32). Thus, Record-seq sentinel cells can 
accurately report on DSS-induced colitis dis- 
ease severity while simultaneously revealing 
multiple features of the intestinal inflamma- 
tory environment. 


Record-seq illuminates both host-microbe and 
microbe-microbe interactions 


To assess the performance of Record-seq in 
the presence of other intestinal microbes, we 
started by performing longitudinal in vivo re- 
cording experiments in mice in the presence of 
one other prototypical member of the human 
gut microbiota, Bacteroides thetaiotaomicron 
(B. theta). Distinct transcriptional archives were 
obtained from mice either monocolonized with 
E. coli or cocolonized with EF. coli and B. theta 
(Fig. 5, A to C, and tables S11 and S12). The 
transcriptional records revealed alterations over 
a wide range of microbial functions, including 
a dramatic shift in inferred FE. coli carbon 
source preferences (Fig. 5D; fig. S7, A and B; 
and tables S13 and S14). In the presence of 
B. theta, pathways for utilization of xylose, 
arabinose, sialic acid, amino sugars citrate, 
rhamnose, maltose, and lactose were significantly 
up-regulated. Given that B. theta has a far 
richer content of polysaccharide utilization 
loci compared with that of EF. coli (21, 33) and 
that mice were fed with chow containing com- 
plex plant cell wall carbohydrates (table S1) 
(25), our data supports that cross-feeding (Fig. 
5E) by B. theta liberates usable input nutrients 
(such as mono- and oligosaccharides) from 
otherwise unmetabolizable complex diet- or 
host-derived materials (such as plant pectins 
and mucus glycans) (34). Supporting the notion 
that nutrient cross-feeding is beneficial for 
E. coli, we observed a 3.4-fold increase in E. coli 
biomass in the presence of B. theta compared 
with that in monocolonized mice (fig. S7C). 

The presence of B. theta also led to down- 
regulation of E. coli genes involved in metab- 
olism of sugar alcohols, amino acids, fructose, 
nucleotides, and ethanolamine, which suggests 
that these secondary carbon sources were not 
required during bicolonization (Fig. 5D and 
fig. S7B). Selective expression of ribosomal 
proteins when E. coli cocolonized with B. theta 
(Fig. 5D and fig. S7A) is likely indicative of a 
stringent response on nonpreferred carbon 
sources when E. coli colonizes alone. These 
results, which we validated independently (fig. 
S87, D to G, and tables S12 to S14), demonstrate 
that Record-seq sentinel cells sense alterations 
in the intestinal environment in the presence 
of another taxon and report on the transcrip- 
tional adaptations that occur as a result. 

We next assessed the capacity of Record-seq 
to characterize gut function in the presence of 
a complex microbiota by gavaging sentinel 
cells into chow- or starch-fed mice colonized 
by a defined 12-member sDMDMm2 consor- 
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tium (Fig. 6A) (35). Although colonization re- 
sistance caused most gavaged E. coli sentinel 
cells to rapidly pass through the gastrointes- 
tinal tract (fig. S8A), we detected transcrip- 
tional records as early as 6 hours after gavage 
(fig. S8B) and differential expression accord- 
ing to diet from 10 hours after gavage (Fig. 6B). 
Additionally, the recorded information was 
sufficient for stratifying the diet groups (fig. 
S8C) and reproduced in an independent ex- 
periment (fig. S8D). By 21 hours, Record-seq 
detected 220 DEGs in E. coli passing through 
sDMDMm2?2-colonized mice on a chow versus 
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starch diet (Fig. 6C). STRING network, KEGG/ 
GO, and EcoCyc pathway enrichment analysis 
revealed diverse features of EF. coli’s adaptation 
to these ecologically complex environments, 
including diet-dependent alterations in carbon 
metabolism (such as galactonate metabolism), 
anaerobic versus aerobic energy harvesting, 
and stress responses (Fig. 6D; fig. S8, E to G; 
and tables S15 to S18). Because none of the 
bacterial species from the sDMDMm2 micro- 
biota encode a complete DeLey-Doudoroff ga- 
lactonate degradation pathway, the finding of 
prominent up-regulation of genes involved in 
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Fig. 6. Sentinel cells are deployable within a complex microbiota. (A) Timeline of complex microbiota 
recording experiment. Mice harboring a representative 12-member intestinal microbiota (SDMDMm2) 
were placed on either a chow or starch diet, supplied with aTc in the drinking water, and gavaged with a 


dose of 7 x 10° CFU aTc-pretreated E. coli 


G1655 as indicated. Fecal Record-seq sampling is indicated. 


(B) Dot plot showing the logsFC for Record-seq DEGs corresponding to the indicated diet and time. Dot 
sizes increase with significance (Pag range, 1.9 x 107” to 1.0). Colored dots indicate a Pag < 0.1, and gray 
dots indicate nonsignificant differences. (©) Heatmap showing hierarchical clustering of Record-seq data 

at 21 hours from mice fed a chow (blue) or starch (green) diet based on identified DEGs. z-score 
standardized gene-aligning spacer counts are shown. (D) Pathways and transcriptional and translational 
regulators identified as enriched (P < 0.05) in mice fed chow (blue) or starch (green) by use of EcoCyc. 
Dot sizes increase with number of genes detected as significantly up-regulated for the respective pathway. 
(B) to (D) correspond to (A), with n = 6 independent biological replicates for each diet. Count threshold was 
5 x 10°. Outliers were excluded on the basis of modified z-score and relative deviation from the mean. 
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galactonate metabolism in FE. coli from chow- 
fed sDMDMm2 mice indicates that galacto- 
nate is likely available as a substrate for E. coli 
in this in vivo microbial consortium (36). Con- 
sistent with our earlier monocolonization re- 
sults (Fig. 2C), E. coli from starch-fed sDMDMm2 
mice overexpressed genes for utilization of 
mucus-derived sugar acids such as hexuronates 
and gluconate (fig. S8G), supporting the inter- 
pretation that host-derived sugar acids become 
more important even in the presence of an 
unmanipulated gnotobiotic microbiota as the 
dietary nutrient sources become less rich 
(22, 36). Thus, transcriptional recording senti- 
nel cells function in the context of an intestinal 
microbiota to reveal a portfolio of host-microbe 
and microbe-microbe interactions. 


Multiplexed Record-seq enables parallel 
transcriptional profiling of isogenic bacterial 
strains coinhabiting the mouse intestine 


Although genetic polymorphisms between taxa 
potentially allow RNA-seq to distinguish the 
transcriptional profiles of different taxa within 
an intestinal consortium, it is not informative 
of adaptive transcriptional differences between 
isogenic strains of the same taxon differing 
according to one genetic locus (37). We hy- 
pothesized that Record-seq could meet this 
need, allowing a mechanistic understanding 


Fig. 7. Sentinel cells enable 


leader-D 
multiplexed transcriptional 
profiling of isogenic bacterial 
strains coinhabiting the mouse leader-D 


intestine. (A) Schematic for 
multiplexed in vivo recording 

with sentinel cells barcoded by 
their CRISPR array. Recording 
plasmids with either a leader-DR1 
or leader-DR2 CRISPR array were 
transformed into WT or mutant 

E. coli cells and orally gavaged into 
mice at a 1:1 ratio. Spacers were 


of how a particular genetic lesion is function- 
ally compensated within a taxon when two 
strains are coinhabiting the intestine. 

To test this concept, we modified the Record- 
seq technology. First, we leveraged recent 
insights revealing that CRISPR spacer acqui- 
sition is aided by transcription-coupled repair 
(38) and introduced a constitutive promoter 
upstream of the CRISPR array within the re- 
cording plasmid, which improved recording 
efficiency. Second, we developed multiplexed 
transcriptional recording (Fig. 7A) by using 
two orthogonal CRISPR arrays with distinct 
leader and direct repeat (DR) sequences (17), 
referred to here as DR1 and DR2. After con- 
firming that the barcoded recording constructs 
facilitated labeling and computational stratifi- 
cation of the transcriptional archives of iso- 
genic EF. coli strains in vitro and in vivo (figs. S9 
and S10), we investigated whether multiplexed 
Record-seq could reveal the compensatory 
mechanism of an isogenic single-gene mutant 
in a competitive setting in vivo. We prioritized 
uxaC-deficient E. coli because Record-seq had 
revealed the importance of wxvaC under the 
starch diet (Fig. 3, C and D). In two indepen- 
dent experiments, germ-free mice cocolonized 
with WT and uxaC-deficient (AwxaC) E. coli 
MG1655 harbored barcoded recording plas- 
mids in which WT-DR2 was in competition 


with AwxaC-DRI1 (group 1) or conversely WT- 
DR1 competed with AwxaC-DR2 (group 2) (fig. 
S11A). Each isogenic strain revealed robust 
transcriptional recording activity in vivo, and 
differences in the transcriptional signatures 
were driven by genotype (Fig. 7B and figs. SIIB 
and S12), demonstrating multiplexed tran- 
scriptional recording of two isogenic E. coli 
strains inside the same mouse intestine. 

Analysis of DEGs (table S19) and pathways 
(tables S20 and S21) revealed decreased ex- 
pression by the wraC-deficient mutant of other 
hexuronate utilization genes such as wxraA, 
uxaB, uxuA, and uxuB in addition to the ex- 
pected lack of wraC (Fig. 7C and fig. S12C). 
Because hexuronate utilization genes are in- 
duced by galacturonate and/or glucuronate 
(39), the WT strain likely displaces the AuxaC 
strain from niches where these sugar acids are 
available. Furthermore, the AwzxaC strain ap- 
pears to compensate for this deficiency through 
the up-regulation of serine/threonine- and 
maltose-utilizing gene products. Thus, Record- 
seq has the distinctive capacity to enable 
multiplexed transcriptional profiling of two 
isogenic strains of EF. coli in the same mouse 
gut. This approach reveals compensatory mech- 
anisms in response to intraspecies competi- 
tion in which RNA-seq-based experiments are 
uninformative. 
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deficient barcoded E. coli sentinel 
cells. Two different pairings 
between genotype (WT or AuxaC) 
and CRISPR array (DR1 or DR2) 
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AuxaC-DR1 (blue) and WT-DR2 
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DR2 (green) and WT-DR1 
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standardized gene-aligning spacer counts are shown. (C) Pathways and transcriptional and translational regulators identified as enriched (P < 0.05) by use of EcoCyc 
on the basis of high-confidence DEGs between AuxaC (red) and WT (blue) E. coli. Dot sizes indicate gene number significantly up-regulated for the respective 
pathway. (B) and (C) correspond to fig. S11A, with n = 5 independent biological replicates of each condition and days 7 to 10. Count threshold was 10*. Outliers were 
excluded on the basis of modified z-score and relative deviation from the mean. 
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Conclusions 

We demonstrated that transcriptional record- 
ing sentinel cells by using FsRT-Cas1-Cas2 to 
integrate RNA-derived spacers from the E. coli 
transcriptome into plasmid DNA-encoded 
CRISPR arrays are capable of recording com- 
plex and dynamic transcriptional changes 
during the adaptation of FE. coli throughout 
time, transit, and perturbation of the mam- 
malian intestinal tract. This scalable, noninva- 
sive system for assessing intestinal function 
in vivo archives characteristic microbial sig- 
natures of physiological or pathological states. 
Transcriptome-scale recordings elucidate mi- 
crobial responses to alterations in the intralu- 
minal environment across nutrition, intestinal 
inflammation, and microbe-microbe interac- 
tions. We have illustrated how carbon prefer- 
ences can be shown with the tool without 
confounding manipulations and validated 
findings of intraluminal microbial adaptation 
under different dietary conditions. 

Record-seq offers multiple advantages com- 
pared with contemporary techniques. First, 
unlike conventional cell-based biosensors, 
Record-seq does not require a specific bio- 
sensor for every biomolecule of interest and 
can report on a wide range of complex biolog- 
ical features, serving as an unbiased discov- 
ery tool. Second, compared with conventional 
omics-based technologies run on fecal sam- 
ples, Record-seq integrates information on gut 
function along the length of the intestine, 
which is particularly valuable for studying the 
proximal large intestinal environment that 
has been largely refractory to detailed studies 
because of its inaccessible location. Third, 
multiplexed Record-seq reveals diverse in situ 
microbe-microbe interactions, even between 
variants of a single microbial species within 
the same animal over time, which cannot be 
readily scaled or implemented in high through- 
put with conventional methods. 

We envision that transcriptional recording 
sentinel cells and the Record-seq principle may 
enable understanding intestinal and micro- 
biota physiology under different dietary con- 
ditions, disease contexts, or constraints in 
humans where longitudinal sample collection 
based on surgical, endoscopic, fecal, ingestible 
device, or post mortem methods are neither 
feasible nor sufficient. 


Materials and methods 
Bacterial strains 


Bacterial strains used in this study were 
Escherichia coli strains MG1655 (ATCC no. 
700926) and BL21(DE3) Gold (Agilent no. 
230132) and Bacteroides thetatotaomicron 
strain VPI-5482 (ATCC no. 29148). E. coli 
MG1655 Str®, E. coli MG1655 Str® Nal®, E. coli 
MG1655 Str® AidnK/AgntK, and E. coli MG1655 
Str® AuxaC were provided by T. Conway and 
the Kan® marker of the AraC strain was re- 
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moved using pCP20 recombination as reported 
previously (40) yielding E. coli MG1655 Str® 
AuaxaC AKan®. All E. coli strains used in this 
study are reported in table S22. MG1655 iso- 
lates have been sequenced and their genomic 
sequences are available in the NCBI Assembly 
database (PRJNA807125). NCBI Reference Se- 
quences U00096.3 and NC_012947.1 were used 
for MG1655 and BL21(DE3) Gold, respectively. 
The stable defined moderately diverse mouse 
microbiota 2 (SDMDMm2) has been described 
previously (36, 47) and is available through 
the Deutsche Sammlung fiir Mikroorganis- 
men und Zellkulturen DSMZ. The constitut- 
ing taxa were originally isolated from the 
mouse intestine and comprise Bacteroides 148, 
Blautia YL58, Akkermansia YLA4, Bacteroidales 
YL27, Ruminococcaceae KB18, Lactobacillus 149, 
Lachnospiraceae YL32, Erysipelotrichaceae 146, 
Enterococcus KB1, Flavonifractor YL31, 
Parasutterella YLA5 and Bifidobacterium YL2 
(table S23). 


Mice 

All mouse experiments were performed in ac- 
cordance with Swiss federal and cantonal reg- 
ulations under permit numbers BE43/16, BE44/ 
18 and BE107/20. Germ-free C57BL/6(J) mice 
were born and housed in flexible-film isola- 
tors in the Clean Mouse Facility, University 
of Bern, Switzerland. Unless noted otherwise, 
mice received a vitamin-fortified rodent chow 
diet (Kliba Nafag 3307) sterilized by autoclav- 
ing for 20 min at 132°C and water ad libitum. 
Age and sex-matched mice were used at 6 to 
15 weeks of age (mostly 8 to 12 weeks). Mice 
were constantly and independently confirmed 
to be germ-free within the breeding isolators 
by culture-dependent methods (liquid cultures 
in brain-heart infusion (BHI) broth (Thermo 
Fisher Scientific) aerobically at 37°C at 180 rpm 
and anaerobically in an anaerobic cabinet 
(Meintrup DWS) containing 80% No, 10% Ha, 
and 10% CO, at 37°C without shaking) and 
culture-independent methods (i.e., micro- 
scopic examination of fecal smears stained with 
the DNA dye SYTOX green (Thermo Fisher 
Scientific)). 

During experiments, the absence of bacteria 
other than E. coli (and B. theta in experiments 
related to Fig. 5 and fig. S7) was constantly 
confirmed by culturing of fecal suspensions 
on lysogeny broth (LB) agar aerobically and on 
BHI agar with 5% defibrinated sheep blood 
anaerobically. The microbial biomass per gram 
feces was determined by weighing fecal pellets, 
homogenizing them in 1 ml of PBS using a 
Retsch MM400 tissue lyser at 30 Hz for 3 min 
and streaking serial dilutions in PBS onto 
LB agar. 

One day before gavage, drinking water of 
the mice was exchanged for water contain- 
ing 30 ug/ml of anhydrotetracycline (aTc) 
(Adipogen) and 100 pg/ml of kanamycin sul- 
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fate (MP biochemicals) which was prepared 
by diluting stock solutions of 2 mg/ml of aTc 
in 95% ethanol and 100 ug/ml of kanamycin 
sulfate in aqua bidest into sterile tap water. 


Plasmid transformation 


For plasmid transformation, EF. coli BL21 
(DE) Gold (Agilent Technologies) and E. coli 
MG1655 (ATCC no. 700926) were made chem- 
ically competent using the Mix & Go E. coli 
Transformation Kit & Buffer Set (Zymo Re- 
search). For this, strains were streaked to 
single colonies on LB (Difco) agar (Huberlab) 
plates without antibiotics followed by growth 
overnight at 37°C. A single colony of E. coli was 
inoculated into 50 ml of ZymoBroth (Zymo 
Research) and grown at 19°C, 220 rpm in an 
orbital shaker (New Brunswick Innova 40R) 
to an optical density of ODgo9 = 0.45. Subse- 
quently, cells were made competent follow- 
ing the manufacturers protocol, dispensed to 
aliquots of 25 ul, flash-frozen in liquid nitro- 
gen, and stored at -80°C. 

Transformation with the recording plasmid 
pFS_0453 (Addgene #117006) was performed 
by adding ~60 ng of plasmid DNA to 25 ul of 
competent cells, followed by heat shock (42°C, 
30 s), recovery in 120 ul of S.O.C medium at 
37°C, 900 rpm, 30 min and spreading on LB 
agar plates containing 50 ug/ml of kanamycin 
sulfate (Biochemica). Glycerol stocks were 
created by growth of transformants in LB with 
50 ug/ml of kanamycin sulfate at 37°C, 180 rpm 
in bacterial culture tubes followed by mixing 
of 500 ul of saturated culture with 500 ul of 
sterile filtered 50% (v/v) glycerol and freezing 
at -80°C for long-term storage. 


Oral gavage 


Unless stated otherwise, a saturating gavage 
dose of 1 x 10° colony forming units (CFU) of 
FE. coli was used to avoid confounding the 
biological signal reported by our sentinel cells 
with an initial expansion in the gastrointesti- 
nal tract (42). To maintain the recording plas- 
mid and ensure functional stability of the 
sentinel cells, we added kanamycin sulfate to 
the drinking water and confirmed that the 
transformed cells colonized at (7.9 + 2.6) x 
10° CFU/g feces, comparably to what has been 
reported for the parental strain (42). For oral 
gavage, E. coli MG1655 or BL21(DE3) each 
transformed with pFS_0453 was inoculated 
from freshly grown colonies and cultured 
overnight under aerobic conditions in LB con- 
taining 50 ug/ml of kanamycin sulfate at 37°C, 
180 rpm. Upon saturation, cultures were cen- 
trifuged at 3480g for 10 min at room temper- 
ature and washed twice with the equivalent 
volume as the LB culture in sterile phosphate- 
buffered saline (PBS) (8 g per liter of NaCl, 
0.2 g per liter of KCl, 1.44 g per liter of 
Na,HPO,, and 0.24 g per liter of KH2POx,, 
all from Sigma-Aldrich). The required dose of 
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bacteria was resuspended in PBS (1 x 10° CFU 
per 500 ul). The bacterial suspension was 
orally gavaged directly into the mouse duode- 
num with a 12 gauge straight stainless-steel 
needle (Provet AG) attached to a 2-ml syringe. 
The culture volume was chosen according to 
the number of mice to be gavaged with the 
equivalent of 3 ml of saturated culture being 
gavaged into each mouse. Culture vessels were 
at least twice as large as the culture volume to 
ensure proper aeration. For example, to ga- 
vage n = 15 mice, E. coli was grown in 100 ml of 
LB broth in a 250-ml bottle, washed twice with 
100 ml of PBS, and resuspended in 16.6 ml of 
PBS from which 500 ul was gavaged into each 
mouse. Gavage doses and absence of contam- 
ination were confirmed by streaking serially 
diluted suspensions onto LB agar. 


Isolation of RNA and RNA-seq 


Fecal pellets were homogenized in 1 ml of PBS 
using a Retsch MM400 tissue lyser at 30 Hz 
for 3 min. Large particles were pelleted by 
centrifugation at 200g for 2 min at room 
temperature. Bacteria in the supernatant were 
pelleted by centrifugation at 6800g for 3 min 
at room temperature and lysed in 100 ul of 
RNA extraction solution containing 95% (v/v) 
formamide (VWR), 0.025% (w/v) SDS (Bio- 
Rad), 18 mM EDTA (Merck Millipore), and 
1% 2-mercaptoethanol (Merck Millipore) (43) 
at 95°C, 1200 rpm shaking for 7 min. Fol- 
lowing centrifugation at 16,000g for 5 min at 
room temperature, the RNA in the super- 
natant was purified with the RNeasy clean-up 
kit (QIAGEN) following the manufacturer’s 
instructions, including the optional DNase 
treatment (15 min at room temperature). RNA 
was frozen at —80°C for storage and submitted 
to the Next Generation Sequencing (NGS) 
Platform Bern for ribosomal RNA (rRNA) de- 
pletion using the RiboMinus Transcriptome 
Isolation Kit, bacteria (Invitrogen), followed 
by library preparation using the Illumina 
TruSeq Stranded Total RNA Kit (Illumina) 
and sequencing on an Illumina NovaSeq plat- 
form using the NovaSeq 6000 SP Reagent Kit 
(100 cycles). 


Isolation of plasmid DNA from feces and 
intestinal contents. 


For E. coli monocolonized mice, 50-100 mg of 
fecal material or intestinal contents were col- 
lected and frozen at —20°C. After thawing, 
feces or intestinal contents were homogenized 
in 1 ml of PBS using a Retsch MM400 tissue 
lyser at 30 Hz for 3 min. Large particles were 
pelleted by centrifugation at 200g for 2 min 
at room temperature. Bacteria in the super- 
natant were pelleted by centrifugation at 
6800g for 3 min at room temperature and 
plasmid DNA was isolated using the QIAprep 
spin Miniprep kit (QIAGEN) with the follow- 
ing modifications: FE. coli cells were resus- 
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pended in 500 ul of buffer P1 and then lysed by 
the addition of 500 ul of buffer P2 for 5 min at 
room temperature. The reaction was neutral- 
ized by the addition of 700 ul of buffer N3 and 
centrifuged at 18,000g for 10 min at room 
temperature to pellet debris. Supernatant was 
passed through a spin column on a vacuum 
manifold (QIAGEN), upon which the column 
was washed with 500 ul of buffer PB followed 
by 700 ul of buffer PE. Residual wash buffer 
was removed by centrifugation at 18,000g for 
1 min at room temperature. For cecal contents 
of monocolonized mice (~200 mg), buffer vol- 
umes were increased to 1000 ul of buffer Pl, 
1000 ul of buffer P2, and 1400 ul of buffer N3, 
whereas volumes of buffer PB and PE remained 
the same. Plasmid DNA was eluted from the 
column by the addition of 50 ul of pre-warmed 
buffer EB (55°C) followed by incubation at 55°C 
at 850 rpm for 3 min followed by centrifugation 
at 10,000g for 1 min. A total of three elution 
steps with 50 ul were performed yielding 150 ul 
of eluate. DNA from this eluate was precipi- 
tated by the addition of 15 ul of 3 M sodium 
acetate solution (Sigma-Aldrich) and 150 ul of 
2-propanol (Merck-Millipore) followed by in- 
cubation at —20°C for 20 min and centrifuga- 
tion at 20,000g for 20 min at room temper- 
ature. Supernatant was carefully removed 
without disturbing the pellet which was washed 
with 500 ul of 80% (v/v) ethanol and centri- 
fuged at 20,000g for 15 min at room temper- 
ature. Ethanol was removed completely and 
the pellet was briefly dried (55°C, 30 s) before 
resuspension in 15 ul of buffer EB and transfer 
to 96-well PCR plates for storage at —-20°C or 
immediate use in SENECA. 


Quantification of isolated plasmid DNA by 
droplet digital PCR (ddPCR) 


For quantification by ddPCR, plasmid DNA was 
first diluted 100,000-fold in ddPCR dilution 
buffer. ddPCR dilution buffer consisted of 
2 ng/ul of sheared salmon sperm DNA (Sigma 
Aldrich) and 0.05% Pluronic F-68 (Invitrogen) 
in UltraPure DNase/RNase-Free distilled water 
(Thermo Fisher Scientific). Dilution steps were 
performed in twin.tec PCR plate 96 LoBind 
(Eppendorf) as follows: dilution 1: 1 ul of plas- 
mid DNA, 49 ul of ddPCR dilution buffer; 
followed by dilution 2: 1 ul of dilution 1, 49 ul 
of ddPCR dilution buffer; and finally dilution 
3:1 ul of dilution 2, 39 ul of ddPCR dilution 
buffer. Primer-probe assays targeting FsRT- 
Casl were prepared by mixing of 180 ul of 
FS_2814 (100 uM), 180 pl FS_2815 (100 uM), 
and 50 ul of FS_2816 (100 uM) with 590 ul of 
TE buffer (Sigma Aldrich) (table $24). Aliquots 
of 100 ul of primer-probe assay were prepared 
and stored in 1.5-ml amber tubes (Eppendorf) 
at -20°C. ddPCR was performed by mixing 
4.5 wl of dilution 3 template, 1.1 ul of primer- 
probe assay, 0.25 1] of FastDigest XhoI (Thermo 
Fisher Scientific), 11 ul of ddPCR Supermix for 
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probes (no dUTP) (Bio-Rad), and 5.4 ul of 
UltraPure DNase/RNase-Free distilled water 
per reaction. PCR reactions were dispensed 
into droplets using the QX100 droplet gene- 
rator (Bio-Rad) according to the manufacturer’s 
protocol. PCR amplification was performed in 
an Eppendorf Mastercycler Gradient (95°C for 
10 min, followed by 42 cycles of 95°C for 30 s, 
57.1°C for 60 s, 72°C for a 15-s final extension, 
and a final 98°C 10-min step) with a ramp rate 
for all steps of 2°C/s as specified by the man- 
ufacturer. Readouts were measured using a 
QxX100 droplet reader (Bio-Rad) with a cut-off 
for positive droplets manually set to 3500. 


Selective amplification of expanded CRISPR 
arrays (SENECA) 


The SENECA library preparation method has 
been extensively described before (78). All DNA 
oligonucleotides were ordered from IDT (table 
$25), FastDigest FaqI (ThermoFisher Scien- 
tific), T7 DNA Ligase and NEBNext High Fi- 
delity PCR Master Mix, 2X (both New England 
Biolabs). Due to the low concentration of plas- 
mid DNA extracted from fecal pellets and re- 
sidual genomic DNA, input DNA was not 
normalized but instead 3.75 to 7.5 ul of pu- 
rified plasmid DNA was used for SENECA 
adapter ligation, volumes are stated in the sub- 
sections below corresponding to the respective 
experiments along with the specific annealed 
oligonucleotides (carrying library barcodes) 
for adapter ligation. SENECA first round PCR 
was performed with 23 cycles instead of 22 
cycles and otherwise as described before (8). 
After second-round PCR, 3 ul of each sample 
were mixed with 17 ul of UltraPure DNase/ 
RNase-free distilled water (Thermo Fisher 
Scientific) and loaded on an E-Gel 48 Agarose 
Gel, 2% along 150 ng of GeneRuler low-range 
DNA ladder (Thermo Fisher Scientific) in 20 yl 
for gel-based quantification using the software 
Bio-Rad Image Lab version 6.0.1. Samples 
from an experiment were assigned to five bins 
based on their DNA concentrations and pooled 
according to these bins. Each sub-pool was 
purified separately by PCR purification and gel 
extraction from E-GelEX 2% (Thermo Fisher 
Scientific) agarose gels as described previ- 
ously, individually quantified by quantitative 
PCR (qPCR) using the KAPA Library Quanti- 
fication Kit for Illumina Platforms (Roche), 
pooled to achieve equal sequencing depth ac- 
cording to the number of samples in each sub- 
pool and sequenced on an Illumina NextSeq 
500/550 platform as described before. As Record- 
seq is a population-based measurement re- 
quiring many cells to reconstruct a cellular 
history, and in vivo experiments present a 
material-limiting environment, we addressed 
the technical inputs and outputs of our work- 
flow (fig. S1, A and B). Record-seq usually used 
an input, of approximately (4.7 + 0.7) x 10° cells 
per sample (3 to 4 fecal pellets of ~27 mg each 
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per mouse with a biomass of 5.6x10° sentinel 
cells per gram). After 24 hours of in vivo re- 
cording, the Record-seq output was (3.6 + 0.6) x 
10° spacers, which increased by (1.0 + 0.5) x 10* 
spacers per day. After 7 days of recording, we 
found (1.2 + 0.1) x 10° spacers, which aligned 
to 91 + 1% of the 4419 transcripts in the E. coli 
transcriptome. 


Primary analysis of data 


The single-end sequencing readout from Record- 
seq and RNA-seq was processed and analyzed 
using a two-stage computational pipeline as 
described before (18). The first step in the pri- 
mary analysis pipeline involved pre-processing 
of sequencing reads using FastQC v0.11.4 (http:// 
www.bioinformatics.babraham.ac.uk/projects/_ 
fastqc) and trimmomatic v0.35 (44). For Record- 
seq, FASTQ files containing sequencing results 
were converted to FASTA files using the FASTX- 


toolkit v0.0.14 (http://hannonlab.cshl.edu/fastx_ 


toolkit). Reads without the library barcode were 
excluded and acquired unique spacer sequences 
were identified from the remaining reads using 
a dedicated python script which incorporates 
fuzzy string matching (spacerExtractor.py). Iden- 
tified unique spacer sequences for Record-seq 
and sequencing reads for RNA-seq were aligned 
to merged references that contain E. coli ge- 
nome and plasmid sequences and annotation 
using Bowtie 2 (45). The following E. coli ref- 
erence genomes were used: FE. coli str. K-12 
substr. MG1655 (GenBank U00096.3) and 
E. coli BL21-Gold(DE3) pLysS AG (Ensembl 
ASM2366v1). Alignments were then processed 
and assigned to either the reference E. coli 
genome or plasmid using Samtools v1.3 (46), 
and for Record-seq, duplicate alignments 
were removed using a custom python script 
(SErmdup.py). This two-step stringent filter 
for duplicate spacers was incorporated to re- 
move multiple instances of the same spacer 
arising due to amplification or plasmid repli- 
cation, and thus obtain a conservative esti- 
mate of spacer diversity. Count matrices were 
generated by quantifying alignments using 
featureCounts from the Subread package (47). 
These count matrices contain transcript 
counts for each RNA-seq sample and transcript- 
aligning spacer counts for each Record-seq 
sample. All the steps of this primary analysis 
pipeline were implemented in Snakemake 
(48) workflows. The Snakemake workflows 
for each experiment described in this man- 
uscript, python scripts, reference genomes, 
and ancillary scripts for primary analysis 
are available on GitHub (Data and materials 
availability). 


Secondary analysis of data 


Secondary analysis was performed on gener- 
ated count matrices by building upon the 
previously described recoRdseq package im- 
plemented in R (/8). This broadly involved 
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unsupervised clustering of samples and iden- 
tification of classifier genes based on differen- 
tial expression analysis. Complete analyses for 
all experiments described in this manuscript 
are available as R notebooks on GitHub (Data 
and materials availability). In general, the first 
step involved filtering count matrices by ex- 
cluding outlier samples with low cumulative 
counts (C = So tess x = count, ¢ = transcript, 
S = sample) among replicates using an empir- 
ically adjusted absolute cumulative counts 
threshold, as well as a combined threshold 
for outliers as described below: 

(i) include replicate if modified Z-score Z; > -3 


Ze ahs where y; = cumulative 
count for replicate 7, y = median cumulative 
count for all replicates), else 

Gi) if Z; < -3, include replicate if relative 
deviation from the mean of replicates D; < 0.25 
Dy ee 

¥y 
replicate 7, and y = mean cumulative count for 
all replicates). 


Lowly abundant (or recorded) transcripts, 
defined as transcripts having a low cumulative 


, where y; = cumulative count for 


count across samples (}> a5; # = count, ¢ = 
transcript, s = sample), were also excluded 


from the analysis. Further, the first day after 
gavage (Day 1) generally yielded low spacer 
counts and noisy data compared to later days 
in multi-day experiments, hence Day 1 was ex- 
cluded from all subsequent analyses. The 
count matrices were then normalized and 
transformed using the variance-scaling trans- 
formation (VST) implemented in the DESeq2 
package (49), which rendered the data approx- 
imately homoscedastic. For dimensionality re- 
duction and unsupervised cluster discovery, 
principal component analysis (PCA) using the 
R base stats package and Uniform Manifold 
Approximation and Projection (UMAP) (50) 
using the umap package implemented in R 
were performed on the vst-transformed count 
matrices. UMAP parameters and hyperparam- 
eters were tuned for each dataset to achieve 
optimal separation between experimental groups. 
k-medoids clustering was used to detect clus- 
ters in PCA-projected data. Fixed random num- 
ber seeds were used to ensure reproducibility 
of clustering algorithms. PCA and UMAP re- 
sults as well as other illustrative plots were 
plotted using the ggplot2 package in R. Dif- 
ferential expression analysis was performed 
using the Wald test (pairwise comparisons) 
or likelihood ratio test (multiple-group com- 
parisons) implemented in DESeq2 and the 
quasi-likelihood F-test implemented in the 
edgeR package in R. DEGs were defined as 
the intersect of significant genes (Paaj < 0.1, 
where Pa; = Benjamini-Hochberg adjusted 
P-value detected by these two tools). For time- 
course datasets, DEGs were independently 
identified for each timepoint and combined 
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differentially expressed gene lists, ordered by 
the number of independent timepoints each 
individual gene was detected on, were gen- 
erated for downstream pathway analysis. 
Volcano plots were generated for individual 
timepoints using the log,-fold change (log,FC) 
and Paaj values calculated by DESeq2. For 
time-course analysis, log,FC for all genes in 
the combined differentially expressed gene 
lists were plotted over time, with point size 
indicating Pa; values. For downstream path- 
way analyses, the log.FC for each gene (and 
each comparison) was defined as the maximum 
log,FC detected for that gene over the time- 
course. Hierarchical clustering was performed on 
vst-transformed counts of DEGs after Z-score 
standardization for each gene, and heatmaps 
were generated using the pheatmap package in R. 

EcoCyc pathway enrichment analysis was 
performed using the Fisher Exact test for lists 
of DEGs generated for each experiment, and 
pathway enrichment plots were created using 
the top hits. Further, network analysis was 
performed using the StringApp package (57) 
in Cytoscape with a confidence score =0.4 for 
these DEGs. MCL clustering using the cluster- 
Maker2 package (52) was used to generate gene 
clusters within gene networks detected using 
StringApp analysis in Cytoscape, and the func- 
tional enrichment function was used to anno- 
tate these clusters using KEGG pathways, 
UniProt keywords, NetworkNeighborAL, GO 
Process, GO Function, GO Component. Nodes 
contributing to functional enrichment of a 
cluster were marked by bold font. The size of 
a node corresponding to each gene in the 
STRING networks was adjusted to reflect 
the detected log, FC value for that gene. Over- 
representation analysis (OA) was performed 
for differentially expressed gene lists from each 
experiment based on both the GO resource 
(53) and Kyoto Encyclopedia of Genes and 
Genomes (KEGG) (54) using the clusterProfiler 
package (55) in R. Wherever experiments were 
replicated, regression analysis was performed 
to examine the similarity of regulation in the 
latter experiment for DEGs detected in the ini- 
tial experiment. The specific analysis param- 
eters used for each experiment and any changes 
or additions to the workflow are reported in 
the following sections. 


Titration of aTc concentration in drinking water 


Germ-free C57BL/6(J) mice were maintained 
on the chow diet as described above and re- 
ceived water containing 1, 10, or 30 ug/ml of 
aTc as well as 100 g/ml of kanamycin sulfate 
1 day prior to gavage of 1 x 10° cells of E. coli 
BL21(DE3) transformed with pFS_0453. Plas- 
mid DNA was extracted and concentrated as 
described above and 6.25 ul of plasmid DNA 
was used as an input into SENECA using an- 
nealed adapter ligation oligonucleotides FS_ 
0963 and FS_0964: (table S25). 
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Record-seq comparison of transient chow, fat, 
and starch-based dietary stimulus 

Germ-free C57BL/6(J) mice received the stan- 
dard chow diet (Kliba Nafag 3307) prior to 
the experiment. At the beginning of the ex- 
periment (2 days before gavage), one group 
remained on the chow diet, whereas the other 
two groups received either a starch-based puri- 
fied diet (Research Diets D12450Jii) or a fat- 
based (lard-based) purified diet (Research Diets 
D12492ii), both of which were sterilized by 
two rounds of irradiation with each 10-20 kGy. 
Mice received drinking water containing 
30 ug/ml of aTc and 100 ug/ml of kanamycin 
sulfate 1 day before gavage with 1 x 10° cells 
of E. coli MG1655 transformed with pFS_0453 
as described above. After 7 days on different 
diets, mice from all groups received the chow 
diet (effectively switching fat- and starch-fed 
mice to the chow diet). Plasmid DNA from 
fecal pellets was extracted and concentrated 
as outlined above. Additionally, intestinal 
contents were sampled yielding the data pre- 
sented in fig. S1. The initial, 20-day diet switch 
experiment described in Fig. 1 and fig. $2 
used 3.75 ul of plasmid DNA as an input into 
SENECA using annealed adapter ligation oli- 
gonucleotides FS_2759 and FS_2769 (table S25). 
The consecutive, 14-day diet switch experi- 
ment described in fig. S3 used 7.5 ul of plas- 
mid DNA as an input into SENECA using 
annealed adapter ligation oligonucleotides 
FS_2240 and FS_2248 (table S25). 

Primary analysis was performed for Record- 
seq and RNA-seq readout as described above 
for both experiments. During secondary anal- 
ysis, the minimum value of cumulative transcript- 
aligning spacer counts (C) was set to 10,000 for 
Record-seq samples, and the minimum value 
of transcript-aligning counts for RNA-seq sam- 
ples was set to 100,000. PCA projections and 
UMAP embeddings were generated for the top 
500 most variable genes across days and diets. 
For the initial 20-day experiment, hierarchical 
clustering and heatmap generation were per- 
formed for DEGs detected by multiple testing 
on day 7 since this was the last day when the 
mice were fed different diets prior to switch- 
ing all mice to the chow diet. For both exper- 
iments, diet-specific signature genes were 
defined as the top 500 DEGs detected for day 7. 

Hierarchical clustering and heatmap gener- 
ation were performed for the final day in each 
experiment using these diet-specific signature 
genes. For the initial experiment, genes en- 
riched or depleted in each diet pair compari- 
son were identified on day 7 using pairwise DE 
testing and used for generating volcano plots. 
EcoCyc pathway (table S5) enrichment was 
performed using DEGs identified for each diet 
pair (tables S2 and S3) and the top hits were 
used for creating pathway enrichment plots. 
STRING network analysis was performed using 
Cytoscape using DEG with log,FC = 1.0. The 
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granularity parameter for MCL clustering of 
STRING networks was set at 2.5 and stringdb:: 
score was used as array source with an edge 
cutoff of 0.5. Clusters containing more than 
four nodes were reported. KEGG- and GO- 
based OA were performed for DEGs identi- 
fied for each diet pair using clusterProfiler 
(table S6). Regression analysis was performed 
to examine the similarity of regulation in the 
latter experiment for DEGs in mice fed the 
chow diet or purified diet based on starch de- 
tected in the initial experiment. 


E. coli AidnK/AgntK in vivo competition assay 


One group of germ-free C57BL/6(J) mice was 
switched to the starch-based diet 48 hours be- 
fore gavage whereas the other remained on 
the standard chow diet. E. coli MG1655 (WT, 
NCBI Assembly database PR.JNA807125) was 
grown in 200 ml of LB and E. coli MG1655 
Str® AidnK/AgntK in 250 ml of LB with 30 ug/ 
ml of chloramphenicol (Sigma-Aldrich) at 37°C, 
180 rpm overnight. Cultures were pelleted by 
centrifugation at 3480g for 10 min at room 
temperature, washed twice with 250 ml of ster- 
ile PBS and combined after the first washing 
step at a 1:1 ratio. The combined E. coli strains 
were finally resuspended in 7.5 ml of sterile 
PBS and diluted 1:10 in sterile PBS to gavage 
~1 x 10° CFU per mouse in 500 ul. Feces were 
collected at 24 hours, 48 hours, and 72 hours 
after gavage and lysed in 1 ml of PBS using a 
Retsch MM400 tissue lyser at 30 Hz for 3 min. 
Serial dilutions were streaked onto LB agar and 
LB agar containing 30 ug/ml of chloramphen- 
icol and the competitive indices were calculated 
as CFUaiank/agntk/CFUwt = CFU sianxyagntK/ 
(CFU ota-CFU sian agntK)- 


E. coli AuxaC in vivo competition assay 


The competition experiment using the AwraC 
mutant was performed as above with the fol- 
lowing alterations: E. coli MG1655 Str® AuxaC 
(NCBI Assembly database PRJNA807125) was 
gavaged into germ-free mice together with 
either MG1655 WT (NCBI Assembly database 
PRJNA807125) or with MG1655 Str® (NCBI 
Assembly database PR.JNA807125). Each strain 
was grown in 100 ml of LB and the AwxaC 
mutant with an additional 50 ug/ml of kana- 
mycin sulfate. Strains were washed with 100 ml 
of PBS, mixed, and then resuspended in a final 
volume of 33 ml to achieve a gavage dose of 
~1 x 10° CFU per mouse in 500 ul. Serial 
dilutions of feces were streaked onto LB agar 
with and without 50 ug/ml of kanamycin 
sulfate. 


Record-seq and RNA-seq assessment of 
different anatomical sections of the murine gut 


One group of germ-free C57BL/6(J) mice was 
switched to the starch-based diet 48 hours 
before gavage whereas the other remained on 
the standard chow diet. Mice received drink- 


13 May 2022 


ing water containing 30 ug/ml of aTc and 
100 ug/ml of kanamycin sulfate water and 
were gavaged with 1 x 10° cells of E. coli 
MG1655 transformed with pFS_0453 as de- 
scribed above. Record-seq plasmid and/or fe- 
cal RNA was extracted from individual mouse 
feces collected daily. On day 7 after gavage, 
mice were sacrificed and EF. coli RNA was 
collected from various intestinal sections of 
individual mice. Cecal contents were mixed 
in a Petri dish to homogenize them and then 
~100 mg was collected for RNA extraction. 
Proximal colon contents were collected at a 
1-2-cm distance from the cecum. Distal colon 
contents were collected in the terminal 2 cm 
of the colon. RNA was extracted, sequenced, 
and analyzed as described above. The exper- 
iment was performed twice: once with sam- 
ples being pooled from each three individual 
mice, once with sampling from individual mice. 
Primary analysis was performed for RNA-seq 
readout as described above for both experi- 
ments. During secondary analysis, the mini- 
mum value of transcript-aligning counts was 
set to 100,000. Differential expression analysis 
between the diet groups was performed for 
each intestinal section (cecum, proximal colon 
and distal colon) independently. Record-seq 
DEGs were overlapped with DEGs identified 
through RNA-seq proximally (cecum or proximal 
colon) or distally (distal colon) (Paajsection) < 
0.1; log2FCRecord-seq * log2FCeection = 0). Dif- 
ferential expression analysis between the 
intestinal sections was performed for each diet 
group independently. Genes enriched in a par- 
ticular intestinal section were defined as an 
overlap of genes identified as up-regulated in 
that section compared to the other two sec- 
tions (e.g., genes enriched in the cecum on the 
chow diet were defined as genes that were up- 
regulated in the cecum on the chow diet com- 
pared to both the proximal and distal colon). 
Rank-based normalization was used for com- 
paring Record-seq and RNA-seq counts to ac- 
count for the differences in count distributions. 


In vitro exposure of E. coli to DSS 


For each replicate, three colonies of E. coli 
MG1655 transformed with pFS_0453 were 
inoculated into 2 ml of terrific broth (TB) 
(24 g per liter of yeast extract, 20 g per liter of 
tryptone, 4 ml per liter of glycerol, 17 mM 
KH.PO,, and 72 mM K,HPO,) containing 
50 ng/ml of aTc and 0.1, 0.3, 1, 3, or 10% (w/v) 
of DSS. After overnight culture at 37°C, 
220 rpm bacterial cultures were pelleted and 
plasmid DNA extracted using 500 ul of buffer 
Pl, 500 ul of buffer P2, and 700 ul of buffer 
N3. Washes were performed as described 
above. A single elution was performed by add- 
ing 60 ul of buffer TE, followed by incubation 
at 55°C, 850 rpm for 1 min and centrifugation 
at 20,000g for 1 min at room temperature. 
Plasmid DNA was quantified as described (78), 
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and input normalized for SENECA using an- 
nealed adapter ligation oligonucleotides FS_ 
2108 and FS_2109 (table $25). Primary and 
secondary analysis of data was performed as 
described above, with the minimum value of 
cumulative transcript-aligning spacer counts 
per sample (C) set to 10,000. 


Record-seq assessment of DSS colitis in vivo 


Germ-free C57BL/6(J) mice received drinking 
water containing 30 pg/ml of aTc and 100 pg/ 
ml of kanamycin sulfate water and were ga- 
vaged with 1 x 10° cells of E. coli MG1655 or 
E. coli BL21(DE3) transformed with pFS_0453 
as described above. DSS (molecular weight 
36 to 50 kDa, Lucerna-Chem) was dissolved 
in tap water to 1, 2, or 3% (w/v) with aTc and 
kanamycin sulfate as above. Mice received 
DSS-containing drinking water for 5 days as 
illustrated in the experimental outlines. Mice 
were subsequently switched back to aTc and 
kanamycin sulfate-containing water without 
DSS. Mice were monitored daily for signs of 
distress. Mice treated with 3% DSS (w/v) in 
the drinking water had to be removed from 
the study on day 13 of the experiment due to 
colitis-induced distress and are thus omitted 
from the analysis for days 14 to 19 in Fig. 4, B 
and C. Plasmid DNA from fecal pellets was ex- 
tracted and concentrated as outlined above. 

The 2% DSS (w/v) experiment (Fig. 4D) with 
E. coli MG1655 used 3.75 ul of plasmid DNA as 
an input to SENECA using annealed adapter 
ligation oligonucleotides FS_2806 and FS_2807 
(table $25). The initial experiment, including 
different concentrations of DSS (1%, 2%, 
3% w/v) with E. coli BL21(DE3) used 7.5 ul 
of plasmid DNA as an input to SENECA using 
the annealed adapter ligation oligonucleotides 
FS_2238 and FS_2246 (table $25). 

Primary analysis of the Record-seq sequenc- 
ing readout for both experiments was per- 
formed as described above. For the initial 
experiment (Fig. 4A) using FE. coli BL21(DE3), 
the minimum value of C was set to 5,000 
counts. Hierarchical clustering and heatmap 
generation using DEGs was performed for day 
19. For inferring the trajectory of DSS-induced 
differences between treatment groups using 
PCA-projected data from days 5 to 9 (fig. S6D), 
an approach analogous to Slingshot (56) was 
employed: clusters were defined using k-medoids 
clustering with an optimal x value (& = 5) de- 
termined by the Elbow method based on 
minimizing within-cluster sum of squares. 
Branching of trajectories was inferred based 
on the assumption that clusters with samples 
from earlier timepoints branch out into clus- 
ters with samples from later timepoints, with 
the nodes of the plotted trajectory indicating 
the center of each cluster. For discriminating 
between treatment groups, Record-seq data 
on days 6 to 19 (treatment and post-treatment) 
was randomly split into a 70% training and a 
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30% test set using a fixed random seed to 
ensure reproducibility. One-versus-rest clas- 
sification was performed on the test sets with 
SVM models trained and tuned on the train- 
ing sets using leave-one-out cross-validation 
implemented in the e1071 package in R. Re- 
ceiver operating characteristic (ROC) curves 
were generated using the ROCR package in R. 

For secondary analysis, in the 2% DSS ex- 
periment (E. coli MG1655), the threshold for 
C was set to 10,000 counts for Record-seq. 
PCA dimensionality reduction and k-medoids 
clustering were performed for Record-seq sam- 
ples collected post DSS treatment. k-medoids 
cluster identity was encoded by convex hulls 
in PCA projections. UMAP and time-course 
logFC plots was generated and differential 
expression was performed using days 2 to 
20, and identified DEGs were used for EcoCyc 
pathway enrichment (table S9) and STRING 
network analysis using Cytoscape. MCL clus- 
tering was performed with the granularity 
parameter set at 2.5 and stringdb::score was 
used as array source with an edge cutoff of 
0.5 to identify groups in the STRING networks. 
Clusters containing more than two genes were 
reported. KEGG- and GO-based OA were per- 
formed for DEGs using clusterProfiler (table S10). 


Record-seq in the presence or absence 
of B. theta 


Germ-free C57BL/6(J) mice received the stan- 
dard chow diet throughout the entire experi- 
ment, drinking water containing 30 ug/ml of 
aTc and 100 ug/ml of kanamycin sulfate from 
the day before gavage with 1 x 10° cells of 
E. coli MG1655 transformed with pFS_0453 as 
described above. For oral gavage of B. theta, 
a single colony of B. theta was grown over- 
night in 140 ml of brain heart infusion broth 
(Thermo Fisher Scientific) supplemented with 
0.5 mg per liter of menadione and 5 mg per 
liter of hemin (both Sigma-Aldrich) at 37°C 
under anaerobic conditions without agitation. 
After two wash steps with 150 ml of sterile 
anaerobic PBS, B. theta was resuspended 
and mixed with £. coli suspension to gavage 
each 1 x 10° CFU E. coli and 1 x 10° CFU B. theta 
in a total volume of 500 ul. Plasmid DNA was 
extracted from fecal samples using the proce- 
dure outlined above but increasing the buffer 
volumes to 1000 ul of buffer P1, 1000 ul of 
buffer P2, and 1400 ul of buffer N3. 2-propanol 
precipitation and resuspension were performed 
as described above. The 9-day experiment 
used 3.75 ul of plasmid DNA as an input into 
the SENECA using annealed adapter liga- 
tion oligonucleotides FS_2762 and FS_2772 
(table S25). The 27-day experiment used 3.75 ul 
of plasmid DNA as an input and annealed 
adapter ligation oligonucleotides FS_2758 and 
FS_2768 (table S25). 

For measurement of bacterial colonization 
levels, feces were collected, weighed, and lysed 
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in 1 ml of PBS using a Retsch MM400 tissue 
lyser at 30 Hz for 3 min. Serial dilutions were 
streaked onto LB agar plates to determine 
FE. coli CFU and on BHI agar with 5% de- 
fibrinated sheep blood to determine B. theta 
CFU and incubated aerobically or anaerobic- 
ally, respectively. 

Primary analysis of data was performed for 
Record-seq data as described above for both 
experiments. For secondary analysis, the mini- 
mum value of C was set to 5000. This lower 
threshold for cumulative transcript-aligning 
spacer counts was chosen due to fewer ob- 
served counts, possibly explained by the diffi- 
culty in plasmid DNA extraction from E. coli 
colonized with B. theta. Hierarchical cluster- 
ing was performed and heatmaps were gen- 
erated for the full time-course using DEGs 
detected on at least 2 days. Combined lists of 
DEGs identified on any day (table S11 and S12) 
were used for subsequent EcoCyc pathway en- 
richment (table $13), STRING network analy- 
sis using Cytoscape, and KEGG- and GO-based 
OA using clusterProfiler (table S14). The granu- 
larity parameter for MCL clustering of STRING 
networks was set at 2.5 and stringdb::score was 
used as array source with an edge cutoff of 0.5. 
Clusters containing more than three genes were 
reported. 


Record-seq function in complex microbiomes 
(sDMDMm2 mice) 


C57BL/6(J) mice with the sDMDMm2 micro- 
biota (table S23) received the standard chow 
diet or the starch-based purified diet (Research 
Diets D12450Jii) and were switched to drink- 
ing water containing 30 ug/ml of aTc but no 
kanamycin sulfate 6 days before gavage. The 
gavage procedure was modified as follows: 
400 ml of an overnight culture of EF. coli 
MG1655 cells transformed with pFS_0453 in 
LB with 50 ug/ml of kanamycin sulfate were 
diluted 1:5 into 2 liters pre-warmed LB with 
30 ng/ul of aTc and 50 ug/ml of kanamycin 
sulfate and cultured for another 2 hours. Bacte- 
ria were pelleted by centrifugation at 3480g 
for 10 min at room temperature, washed twice 
with 1 liter of PBS and resuspended in 12 ml to 
gavage 6 x 10°° CFU into each mouse in 500 ul 
of PBS. Plasmid DNA was extracted from fecal 
samples using the procedure outlined above 
but increasing the buffer volumes to 1000 ul of 
buffer P1, 1000 ul of buffer P2, and 1400 ul of 
buffer N3. Isopropanol precipitation and re- 
suspension were performed as described above. 
SENECA adapter ligation was performed using 
3.75 wl of plasmid DNA and annealed adapter 
ligation oligonucleotides FS_2760 and FS_2770 
as well as FS_2761 and FS_2771 (table S25) with 
the following modification compared to the 
standard protocol: annealed adapter ligation 
oligonucleotides were diluted 1:10 instead of 
1:100 in TE buffer after annealing. E. coli colo- 
nization levels were measured by homogenizing 
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and serial dilution of fecal pellets as described for 
the B. theta experiment above but serial dilutions 
were spread on MacConkey agar (Thermo 
Fisher Scientific) with 50 ug/ml of kanamycin 
sulfate to ensure the selective growth of E. coli 
sentinel cells. 

Primary analysis of data was performed for 
Record-seq data as described above for both 
experiments. For secondary analysis, the mini- 
mum value of C was set to 5000. Since the 
genome-aligning spacer counts increased over 
the sampling time course, data from the final 
21-hour timepoint was used for identifying 
DEGs, hierarchical clustering and heatmap gen- 
eration, EcoCyc pathway enrichment (table S17), 
STRING network analysis using Cytoscape, and 
KEGG- and GO-based OA using clusterProfiler 
(table S18). The granularity parameter for MCL 
clustering of STRING networks was set at 2.5 
and stringdb::score was used as array source 
with an edge cutoff of 0.5. Clusters containing 
at least two genes were reported. 


Transcription-stimulated spacer acquisition 


Transcription-stimulated recording constructs 
were generated by designing gBlock FS_3046 
designed following the supplementary methods 
of (38). gBlock FS_3046 (table S25) encodes 
“T7UUCG(terminator)_rrnBT1(terminator)_ 
T7Term(terminator) Golden-Gate(BbsI)_ 
FsLeader-CRISPR-02-array-DR2_50bp(stuffer)_ 
EK120029600(terminator_inverted)” and is 
inserted downstream of FsRT-Cas1-Cas2 in 
pFS_0453 cloning with XhoI and NotI. This 
yields a plasmid where oligonucleotides en- 
coding various promoter sequences can be 
inserted using a Golden Gate reaction with 
BbsI. Based on pFS_1061, plasmids pFS_1061 
to pFS_1067 as well as pFS_1112 to pFS_1114 
were generated. These plasmids encode con- 
stitutive E. coli promoters of different tran- 
scriptional activity upstream of the FsLeader_ 
CRISPR_02_array. These promoters are in- 
serted into pFS_1061 by generating double 
stranded (dsDNA) fragments encoding the 
respective promoter and appropriate over- 
hangs through the annealing of oligonucleo- 
tides FS_3047 and FS_3048 (for pFS_1062), 
FS_3049 and FS_3050 (for pFS_1063), FS_3051 
and FS_3052 (for pFS_1064), FS_3053 and 
FS_3054 (for pFS_1065), FS_3055 and FS_3056 
(for pFS_1066), FS_3057 and FS_3058 (for 
pFS_1067), FS_3210 and FS_3211 (for pFS_1112), 
FS_3212 and FS_3213 (for pFS_1113), FS_3214 
and FS_3215 (for pFS_1114). Oligonucleotide 
sequences are listed in table S25. Oligonucleo- 
tides were annealed by mixing 2.5 ul of 100 ul 
oligonucleotides in buffer TE with 5 ul of 
NEBuffer 2.0 and 40 ul of Ultrapure H,O 
(ThermoFisher Scientific) per reaction. 
The reaction was heated to 95°C for 5 min 
in a thermocycler and cooled to 22°C at a rate 
of 0.5°C/sec. Then the annealed oligonucleo- 
tides were diluted 1:200 in Ultrapure H,O. For 
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each target plasmid a Golden Gate reaction 
was performed containing 40 fmol of pFS_ 
1061, 1 ul of 1:200 diluted, annealed oligonucleo- 
tides, 1 1 of a mixture of ATP and DTT (GO mM 
each) (ThermoFisher Scientific), 0.25 ul of T7 
DNA Ligase (NEB), 0.75 ul of 40% (w/v) 
PEG8000 (Sigma-Aldrich), 0.75 ul of Bpil 
(ThermoFisher Scientific), and 1 ul of buffer 
green (ThermoFisher Scientific). Each reac- 
tion was brought up to a 10-ul final volume 
using Ultrapure H,O (ThermoFisher Scientific). 

From each Golden Gate reaction, 0.5 ul was 
transformed into 5 ul of chemically competent 
E. coli Stb13. Individual clones were grown in 
LB media containing 50 ug/ml of kanamycin 
sulfate, isolated by plasmid mini-prep and 
validated by Sanger sequencing (GATC Euro- 
fins). Correct clones were then transformed 
into chemically competent F. coli MG1655 to 
assess efficiency of recording. In vitro record- 
ing and SENECA reaction was performed as 
described previously (17, 18). Upon identifica- 
tion of J23103 [iGEM Registry of Standard 
Biological Parts (http://parts.igem.org)] gBlock 
FS_3344 encoding “!7UUCG(terminator)_rrmBT1 
(terminator)_T7Term(terminator)_J23103_ 
FsLeader-CRISPR-01-array-DR1_50bp(stuffer)_ 
EK120029600(terminator_inverted)” was de- 
signed and cloned into pFS_0453 using Xhol 
and NotI, yielding pFS_1142. 


Record-seq for multiplexed recording in different 
bacterial chassis 


For in vivo multiplexed recording experiments, 
pFS_1113 and pFS_1142 were transformed into 
chemically competent MG1655 Str®AuxaC 
(“‘AuxaC’, NCBI Assembly database PRJNA807125) 
or MG1655 Str® Nal® (“wt”, NCBI Assembly 
database PRJNA807125) and spread on LB-Agar 
plates containing 50 ug/ml of kanamycin sul- 
fate. Germ-free C57BL/6(J) mice were switched 
to the starch-based diet 72 hours before gavage 
and received drinking water containing 30 p1g/ 
ml of aTc and 100 ug/ml of kanamycin sulfate 
from 24 hours before gavage onwards. All 
FE. coli strains were inoculated into LB me- 
dium with 100 ug/ml of streptomycin sulfate 
and 50 ug/ml of kanamycin sulfate with or 
without 50 ug/ml of nalidixic acid, respectively 
and grown overnight at 37°C with 180 rpm 
shaking. Following washing as described above, 
the following strains were mixed 1:1 and a 
total of 1 x 10° CFU (5 x 10° CFU per strain and 
mouse) were orally gavaged into recipient mice. 
Figure 7B, wt-pFS_1113 together with AwxraC 
pFS_1142 and wt-pFS_1142 together with 
AuxaC pFS_1113. Colonization levels of the 
strains were determined by fecal dilution and 
plating on LB/Str100/Kan50 (both AwxaC and 
wt) and LB/Str100/Kan50/Nal50 (only wt). 
Plasmid DNA was extracted from fecal sam- 
ples using the procedure outlined above but 
increasing the buffer volumes to 500 ul of buf- 
fer P1, 500 ul of buffer P2, and 700 ul of buffer 
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N38. 2-propanol precipitation and resuspen- 
sion were performed as described above. 

SENECA adapter ligation was performed 
using 7.5 ul of plasmid DNA and annealed 
adapter ligation with annealed oligonucleo- 
tides FS_3194 and 3204 as well as FS_3316 
and 3321 (table S25) with the following mod- 
ification compared to the standard protocol: 
annealed adapter ligation oligonucleotides were 
diluted 1:10 instead of 1:100 in TE buffer after 
annealing. Upon annealing to dsDNA frag- 
ments, oligonucleotides FS_3194 and 3204 
form an overhang compatible with FaqI-digested 
FsDR2 and oligonucleotides FS_3316 and 
3321 form an overhang compatible with 
FaqI-digested FsDR1. Therefore, both sets of 
annealed oligos were used in a single SENECA 
adapter ligation reaction to simultaneously 
read out CRISPR spacer acquisition into pFS_ 
1113 and pFS_1142. Accordingly, SENECA first- 
round PCR was performed with FS_968, FS_969, 
FS_970, FS_971, FS_972, FS_973, FS_974 de- 
scribed previousl, which are primers com- 
patible with FsDR2 along with additional 
primers compatible with FsDR1, namely FS_ 
3325, FS_3326, FS_3327, FS_3328, FS_3329, 
FS_ 3330, FS_3331 (table S25) and were each 
used at a concentration of 0.714 uM. The uni- 
versal reverse primer FS_911 (8) was used at a 
concentration of 10 uM. SENECA second- 
round PCR was performed as described above. 

For primary analysis of data, a modified 
version of the Snakemake workflow described 
above was used. This workflow requires the 
additional input of a table with sample-specific 
entries for the reference plasmid, DR sequence 
and library barcode. Samples with multiple 
DR-barcoded EF. coli strains, such as those 
from the in vivo multiplexed recording exper- 
iment, resulted in reads containing both DR 
sequences (FsDR2 and FsDR1). These were 
stratified into strain-specific reads by identify- 
ing the DR-specific library barcode using fuzzy 
string matching and processed independently. 
Secondary analysis was performed as described 
above. 

For the in vitro multiplexed recording ex- 
periment, the minimum value of cumulative 
transcript-aligning spacer counts (C) was set to 
30,000. For the in vivo multiplexed recording 
experiment, the minimum value of cumulative 
transcript-aligning spacer counts (C) was set to 
10,000. DEGs were identified between the 
strains (wt vs AwxaC) for days 7 to 10 in both 
groups of mice independently (group 1: AwxaC- 
DRI, wt-DR2; group 2: wt-DR1, AwxaC-DR2; 
n = 5 in each group). High-confidence DEGs 
were defined as DEGs identified as regulated 
in the same direction for at least 3 out of 
4 days in both comparisons. These high- 
confidence DEGs (table S19) were used for 
EcoCyc pathway enrichment (table S20) and 
KEGG- and GO-based OA using clusterProfiler 
(table S21). 
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NEUROIMMUNOLOGY 


Reversible CD8 T cell-neuron cross-talk causes 
aging-dependent neuronal regenerative decline 
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INTRODUCTION: Axonal regeneration and neu- 
rological functional recovery are extremely lim- 
ited in the elderly. Consequently, injuries to the 
nervous system are typically followed by severe 
and long-term disability. Our understanding 
of the molecular mechanisms underlying aging- 
dependent regenerative failure is poor, hinder- 
ing progress in the development of effective 
therapies for neurological repair. To facilitate 
the design of repair strategies, there is a press- 
ing need to identify critical molecular and 
cellular mechanisms that cause regenerative 
failure in aging. 


RATIONALE: Aging causes a broad spectrum of 
modifications in cell signaling, including changes 
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in metabolism, immunity, and overall tissue 
homeostasis, which play key roles in nervous 
system physiology and response to insults. 
Thus, we hypothesized that injuries to the 
aged nervous system would be followed by 
unique molecular and cellular modifications 
that would contribute to aging-dependent 
regenerative decline. To this end, molecular 
and cellular signatures associated with aging 
and injury to the nervous system were sys- 
tematically investigated by performing RNA 
sequencing from dorsal root ganglia (DRG) 
in a well-established model of sciatic nerve 
injury in young versus aged mice. Insight into 
these mechanisms could allow the discovery 
of previously unrecognized molecular tar- 
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Sequence of events leading to aging-dependent regenerative failure in injured sensory neurons. 

(1) Lymphotoxin B activates NF-«B; (2) phosphorylated NF-«B increases CXCL13 expression; (3) CXCR5*CD8* 
T cells are recruited into the DRG parenchyma; (4 and 5) sciatic axonal injury activates injury signals; 

(6) MHC | is presented by DRG neurons; (7) CXCR5*CD8* T cells engage with MHC | and activate neuronal 
caspase 3; (8) cleaved caspase 3 reduces pAKT and pS6, leading to regenerative failure. 
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gets to counteract aging-dependent regener- 
ative decline. 


RESULTS: Initial analysis of RNA sequencing 
data identified that aging was mainly asso- 
ciated with a marked increase in T cell activa- 
tion and signaling in DRG after sciatic nerve 
injury in mice. Subsequent experiments dem- 
onstrated that aging was associated with 
increased inflammatory cytokines including 
lymphotoxins in DRG both preceding and fol- 
lowing sciatic nerve injury. Specifically, we 
found that lymphotoxin B was required for the 
phosphorylation of NF-«B that drives the ex- 
pression of the chemokine CXCL13 in DRG 
sensory neurons. CXCL13 attracted CD8* T cells 
that expressed the CXCL13 receptor CXCR5 
in proximity to neurons that act as antigen- 
presenting cells by overexpressing major his- 
tocompatibility complex class I (MHC PD) after 
a sciatic nerve injury. The engagement of 
CXCR5*CD8* T cells with MHC I-expressing 
sensory neurons activated caspase 3, which 
impaired pAKT and pS6 signaling, leading to 
regenerative failure. Pharmacological antago- 
nism of caspase 3 activation reversed regen- 
erative failure in aged animals and restored 
pAKT and pS6 expression. Adoptive transfer 
experiments including comparison between 
Cxers'~ and wild-type cells directly impli- 
cated CXCR5*CD8*" T cells in restricting axonal 
regeneration after sciatic injury in the aged 
animals. Finally, neutralization of CXCL13 
with monoclonal antibodies antagonized the 
recruitment of CXCR5*CD8* T cells to the 
DRG and restored the regenerative ability of 
sciatic sensory axons in aged mice to levels 
comparable to those found in young animals. 
CXCL13 antagonism also significantly pro- 
moted skin reinnervation and neurological 
recovery of sensory function. 


CONCLUSION: We have identified an aging- 
dependent mechanism that relies on the 
CXCL13-dependent recruitment and activa- 
tion of CXCR5*CD8" T cells in proximity to 
sensory neurons that restrict axonal regen- 
eration after nerve injury after communica- 
tion with MHC I-presenting DRG neurons. 
CXCL13 antagonism with humanized mono- 
clonal antibodies can revert regenerative de- 
cline and promote neurological recovery in 
aged mice, suggesting a pathway that could 
be exploited in future therapies. 
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aging-dependent neuronal regenerative decline 
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Aging is associated with increased prevalence of axonal injuries characterized by poor regeneration 
and disability. However, the underlying mechanisms remain unclear. In our experiments, RNA sequencing 
of sciatic dorsal root ganglia (DRG) revealed significant aging-dependent enrichment in T cell 

signaling both before and after sciatic nerve injury (SNI) in mice. Lymphotoxin activated the 
transcription factor NF-«B, which induced expression of the chemokine CXCL13 by neurons. This in 
turn recruited CXCR5*CD8* T cells to injured DRG neurons overexpressing major histocompatibility 
complex class |. CD8* T cells repressed the axonal regeneration of DRG neurons via caspase 3 activation. 
CXCL13 neutralization prevented CXCR5*CD8* T cell recruitment to the DRG and reversed aging- 
dependent regenerative decline, thereby promoting neurological recovery after SNI. Thus, axonal 
regeneration can be facilitated by antagonizing cross-talk between immune cells and neurons. 


ith aging, there is a decline in axonal 

regenerative ability that undermines 

recovery and often leads to long-term 

disability (7-4). The cellular and mo- 

lecular mechanisms underlying aging- 
dependent regenerative decline remain poorly 
understood. Previous studies have shown that 
an age-related impairment in dedifferentiation 
and activation of Schwann cells limits axonal 
regrowth in the injured peripheral nervous sys- 
tem, impairing sensory and motor recovery 
(5, 6). 

Aging itself produces a wide range of pro- 
found modifications in cell signaling, metab- 
olism, immunity, gene regulation, and protein 
translation, which affect tissue homeostasis 
(7-9). Therefore, we hypothesized that aging 
involves the development of unique molecular 
and cellular mechanisms preceding injury, 
which, in combination with injury-specific sig- 
nals, would prime neurons toward regener- 
ative failure. A better understanding of these 
mechanisms could allow the identification of 
additional molecular strategies to counteract 
aging-dependent regenerative decline. 
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RNA sequencing (RNA-seq) of sciatic nerve 
dorsal root ganglia (DRG) in mice identified 
an aging-dependent enrichment in immune 
and cytokine/chemokine signaling that inten- 
sified after sciatic nerve injury. The aging- 
dependent increase in inflammatory cytokines, 
including lymphotoxin, activated NF-«B in 
DRG neurons, which up-regulated expression 
of the chemokine CXCL13. CXCL13 in turn re- 
cruited CXCR5*CD8*" T cells to sites near neu- 
rons acting as antigen-presenting cells (APCs) 
by overexpressing major histocompatibility 
complex class I (MHC I). Activated CD8* T cells 
repressed axonal regeneration of sensory DRG 
neurons by inhibiting regenerative signals via 
caspase 3-dependent reduction in pAKT and 
pS6. In vivo CD8* T cell depletion or CXCL13 
neutralization restored axonal regeneration of 
sensory neurons promoting functional recovery. 

Thus, we identify an aging-dependent mech- 
anism restricting the regenerative ability of 
axons. Moreover, we propose antibody-mediated 
manipulation of neuron-immune cell cross- 
talk as a clinically suitable approach for neu- 
rological repair in the elderly. 


Aging induces extensive changes in adaptive 
immune gene profiles in sciatic DRG at 
homeostasis and after injury 


RNA-seq was performed on sciatic DRG from 
8- to 10-week-old (young) versus 20- to 22-month- 
old (aged) mice either preceding (sham) or 
following a sciatic nerve injury (SNI). Changes 
in gene expression were compared to the base- 
line expression in young sham animals. Sig- 
nificantly differentially expressed genes [false 
discovery rate (FDR) < 0.05] were analyzed by 
Gene Ontology (GO) and Kyoto Encyclopedia 


of Genes and Genomes (KEGG) for functional 
classification (Fig. 1A, fig. SLA, and data S1 and 
$2). The highest number of significantly differ- 
entially expressed genes (FDR < 0.05) was 
identified in aged DRG after injury (fig. S1, B 
to D, and data S1). The down-regulation of 
GO classes implicated in neurodevelopment, 
neurotransmitters, ion transport, G protein- 
coupled signaling, and signal transduction 
observed in young animals failed to occur in 
aged DRG even after SNI (Fig. 1A and data S2). 
Aged DRG displayed a marked enrichment in 
gene expression associated with adaptive im- 
mune responses, especially T cell and cytokine/ 
chemokine signaling, which was amplified 
after SNI (Fig. 1A, fig. SLA, and data $2). The 
prominent aging-dependent regulation of the 
immune response was then prioritized for 
further analysis. A shortlist of 310 transcripts 
(data S3) enriched for immune response ac- 
cording to GO and KEGG (P < 0.01) was used 
to map hypothetical signaling networks. The 
network revealed interconnected functional 
clusters including chemokine-cytokine inter- 
actions and B and T cell signaling (Fig. 1B). 
Notably, the aging-dependent T cell responses 
coincided with a major increase in the number 
of T cell receptor (TCR) clonotypes (fig. SIE). 


Neuronal CXCL13 is elevated in aged DRG 
along with CD8* T cells expressing CXCR5 


Cxcl13 was the most prominently up-regulated 
gene after sciatic injury (Fig. 1C and data S3). 
The expression of the CXCL13 receptor Cacr5 
was also significantly enhanced in aged DRG 
(Fig. 1C and data S3), suggesting the presence 
of a CXCL13-CXCR5 signaling axis. A signifi- 
cant increase in CXCL13 expression was also 
observed in aged sciatic DRG neurons both at 
baseline and 3 days after SNI, when regener- 
ation is reduced relative to young DRG neu- 
rons (Fig. 1, D to H, and fig. S1, F to H). Although 
Cxcl13 expression was also detected around 
the sciatic nerve injury site, no aging-dependent 
changes were observed (fig. S1, I and J). CXCL13 
protein in DRG 3 days after SNI also signifi- 
cantly increased in the aged DRG group rela- 
tive to young DRG mice (Fig. 1D). 

Because CXCL13 is a chemoattractant for 
CXCR5* B and T cells, we next assessed the 
localization and immunophenotype of B and 
T cells within the DRG 3 days after SNI in 
young or aged mice. B and T cells including 
CXCR5" B cells, CKCR5*CD8* T cells, and CD4* 
T cells significantly increased in aged DRG 
parenchyma (Fig. 2, A to D, and fig. $2, A to F). 

A considerable percentage of CXCR5*CD8* 
T cells were CD44*CD69*CD62L,, indicating 
effector memory T cells (Fig. 2, E and F, and 
fig. S2D), whereas only a very small proportion 
of CD8* T cells or CXKCR5*CD8* T cells were 
CD103* and P2RX7’, suggesting a migratory 
rather than a primary tissue-resident pheno- 
type. Approximately half of both CD8* and 


1 of 14 


RESEARCH | RESEARCH ARTICLE 
S 
g 
s & & 
A at & # B 
S FS F 
immune defense response 
20 response to wounding 
adaptive immune response 
lymphocyte mediated immunity 
antigen processing and presentation 
B cell mediated immunity 
regulation of T cell activation 
innate immune response 
10 cytokine-mediated signaling pathway 
lymphocyte proliferation 
cell migration 
regulation of lymphocyte differentiation 
leukocyte mediated cytotoxicity 
a chemotaxis 
2 of gland development 
3 calcium transport/homeostasis 
U cell adhesion Bn, - 
endocytosis 4 Berpinet 
roe alas linked signal transduction @ Chemokines-Cytokines 
| neurotransmitter transport &) T cells 
-10 k myelination 
axonogenesis => MHC classes 
neuronal differentiation 
\ B cell 
neuron development f - @& ici 
intracellular signaling cascade » \F SS Complement 
G-protein coupled receptor protein signaling pathway - 7 re) Transcription factors 
synapse/ action potential regulation/plasticity \ P 
-20 = ion homeostasis & Signaling molecules 
| ion transport as 
Cy) Others 
C SNI Young SNI Aged Di 
Oo 
Ltb > 
Cxer5 ao) 
& 
a Ccl8 . 
5 <x 
2 Ccl5 
> 100 
€ Tnfsf8 E 
a —~ 250 -* Young 6000 —— 
5 116 BS “ ce 
=“ Ee mh oe 
@ = 
g Csf1 Ba 3 am 
® os § 4000 
> Cxcl14 23 150 3g 
G £6 £ 
§ Col12 22 5 
2 oP & % 100 = H 
Ke Ccl9 ON 5 2000 == 
oO e 
50 Q 
Cxel16 E 8 
c 
Ccl7 Ot ot ott o+—_, ; 
0051152253354455556 Young Aged 
0 Cxsel1 Distance from crush site (mm) 
> 
G H a E CXCL13 expression 
Gc 
Sham-Young Sham-Aged SNI-Young SNI-Aged BH 4 ie 
£ no 
= $ 
<n ee 
3% 
Se? 
o° Go 
a $24 
O 28 
GQ 
3 “he 
= 
2 
Ss 
_ CXCL13 ELISA 
se E 100 
oa > os a 
4 po 3 
= 2 Ce 
= = 60 
ke 8 
nN e 40 
w 8 
pa 2 20 
oO a 
5 8 7 wx ma 
") NP os 
& S&S 
Ss 


Fig. 1. Aging induces enrichment in genes associated with chemokine and 
cytokine expression and adaptive immunity in dorsal root ganglia, including 
after SNI. (A) Heatmap shows the top 15 categories of Gene Ontology Biological 
Processes (GO-BPs) of the differentially expressed genes (FDR < 0.05) from three 
comparisons: SNI young, sham aged, and SNI aged versus sham young, respectively 
(n = 3). GO-BPs enrichment of up-regulated (red) and down-regulated (blue) are 
indicated by —logioP (P < 0.01). (B) Cytoscape visualization of the STRING protein- 
protein interaction network of the up-regulated genes (FDR < 0.05) involved in 
immune response in SNI aged versus sham young (P < 0.01). Each node is a gene, 
color-coded according to STRING functional annotation. (C) Heatmap shows the fold 
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change of the differentially expressed chemokines and cytokines. (D) Representative 
SCG10 immunostaining of sciatic nerves 3 days after SNI. The dashed line indicates 
the proximal crush site. Scale bar, 500 um. (E) Normalized SCG10 intensity to 

the proximal crush site (n = 5). (F) Analysis of the regeneration index (n = 5). 

(G) Representative immunostaining of CXCL13, the neuronal marker Tujl, and DAPI in 
DRG 3 days after SNI. Scale bar, 50 um. (H) Quantification of CXCL13 expression 
shown in (G) (n = 4). (1) CXCL13 ELISA from sciatic DRG (n = 3). **P < 0.01, ****P < 
0.0001 [two-way ANOVA with post hoc Sidak'’s test in (E); unpaired Student's t test 
in (F); one-way ANOVA with post hoc Tukey's test in (H) and (I)]. Data are means + SEM 
and represent three [(A), (C), and (I)] or two [(E), (F), and (H)] experiments. 
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Fig. 2. CXCR5*CD8* T cell characterization in DRG before and after SNI. 
(A) Quantification of B and T cell number normalized to the counting beads in 
DRG (n = 3). (B) Percentage of B and T cells of total CD45" cells in DRG 

(n = 3). (C) Number of CXCR5*CD8* or CD4* T cells normalized to the counting 
beads in DRG (n = 3). (D) Percentage of CXCR5*CD8* or CD4* T cells in DRG 
(n = 3). (E) Number of CD62L™, CD44*, or CD69*/CXCR5*CD8* T cells 
normalized to the counting beads in DRG (n = 3). (F) Percentage of CD62L”, 


CXCR5*CD8* T cells also expressed CXCR6, 
suggesting cell retention and restricted recir- 


culation ability (fig. S3). 


In aged animals, the percentage of naive 
total T cells in the blood was reduced, whereas 
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T cells of total CD8* T ce 


significant. Data are mean 


the percentage of memory CXCR5*" T cells was 
significantly enhanced (fig. $4). The number 
of CD8* T cells, including CXCR5*CDs* T cells, 
significantly increased in aged sciatic DRG 


CD44*, or CD69*/CXCR5*CD8* T cells in DRG (n = 3). (G) CD8 coimmunostained 
with CD3 and DAPI or CXCR5 in DRG 3 days after SNI. Scale bar, 10 um. 
(H) Quantification of CXCR5*CD8" T cell number and percentage of CXCR5*CD8* 


Is in DRG in sham or 3 days after SNI (n = 3). 


*P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 [unpaired Student's t test 
in (A) to (F) or one-way ANOVA with post hoc Tukey's test in (H)]; n.s., not 


s + SEM and represent three experiments. 


significant increase 3 days after SNI (Fig. 2, G 
and H). All CD8* cells were CD68" and CD3"*, 
indicating that they were T cells and not mac- 
rophages (fig. S5A). Finally, B cell numbers in 


relative to young at baseline, with a further 
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aged DRG after SNI increased, but these cells 


3 of 14 


RESEARCH | RESEARCH ARTICLE 


localized outside of the DRG parenchyma, 
likely in the perivenular space (fig. S5, B and 
C). Thus, aging is associated with enhanced 
neuronal expression of CXCL13 and recruit- 
ment of CXCR5" active and resident T cells to 
the DRG parenchyma before and after SNI. 


Neuronal CXCL13 is sufficient to recruit 
CXCR5* B and T cells to the DRG and to 
impair axonal regeneration 


Next, we asked whether CXCLI13 was secreted 
and whether its overexpression was sufficient 
to recruit CXCR5* B and T cells to the DRG. 
We first transduced cultured DRG neurons with 
adeno-associated virus (AAV) to overexpress 
CXCL13, and culture medium was collected 
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for the subsequent in vitro splenocyte migration 
assays. Medium from CXCL13-overexpressing 
cultures versus control enhanced the recruit- 
ment of CXCR5* B and T cells (fig. S6). 

To address whether CXCL13 expression 
would affect axonal regeneration, we over- 
expressed CXCL13 in vivo in DRG neurons, 
and interferon-y (IFN-y) and mannitol were 
delivered systemically to induce MHC I pre- 
sentation and enhance blood-DRG barrier per- 
meability, respectively. Immediately thereafter, 
a SNI was performed (fig. S7). Both CXCR5* B 
and CXCR5‘CD8* T cells were significantly 
increased in Cxcl13-overexpressing DRG rela- 
tive to green fluorescent protein (GFP) control 
(Fig. 3, A to F). Sciatic nerve axonal regenera- 


tion was significantly reduced after Cxcl13 over- 
expression (Fig. 3, G to I). 

Thus, CXCL13 expressed by DRG neurons 
can recruit CXCR5* B and T cells and can in- 
hibit axonal regeneration, partially pheno- 
copying the aging phenotype. 


CXCL13 expression in aged DRG requires 
lymphotoxin-dependent NF-«B phosphorylation 


To identify the transcription factors (TFs) pu- 
tatively involved in driving aging-dependent 
CXCLI13 expression, we asked whether TFs 
previously reported to promote CXCL13 ex- 
pression were induced in aged DRG in our 
RNA-seq dataset. Among the TFs known to 
bind to the CXCL13 promoter—including Hifla, 
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Fig. 3. Overexpression of CXCL13 in DRG neurons in young mice facilitates 
the recruitment of CXCR5* B and T cells to the DRG and reduces sciatic 
nerve regeneration. (A) Flow cytometry analysis of CXCR5*CD8* T cells in 
AAV-GFP- or AAV-Cxcl13-—infected sciatic DRG 3 days after SNI. (B) Quantification 
of CXCR5*CD8* T cell number normalized to the counting beads in the sciatic DRG 
(n = 3). (C) Percentage of CXCR5*CD8" T cells in sciatic DRG (n = 3). 

(D) Percentage of CXCR5" B cells in sciatic DRG (n = 3). (E) Number of 
CXCR5*CD4* T cells in sciatic DRG normalized to the counting beads (n = 3). 
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(F) Percentage of CXCR5*CD4* T cells in sciatic DRG (n = 3). (G) Representative 
SCG10 immunostaining of sciatic nerves from AAV-GFP- or AAV-Cxcl13-infected 
mice. The dashed line indicates the proximal injury site. Scale bar, 500 ym. 

(H) Normalized SCG1O intensity to the proximal crush site from the mice infected 
with AAV-GFP and AAV-Cxcl13 (n = 6). (I) Analysis of the regeneration index 

(n = 6). *P < 0.05, ***P < 0.001 [unpaired Student's t test in (B), (C), (D), (E), 
(F), and (I) or two-way ANOVA with post hoc Sidak’s test in (H)]. Data are means + 
SEM and represent three experiments. 
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Pou5fl (Oct4), Rar, Nfkb, and Spi (10, 11)—only 
Nfkb2 was significantly up-regulated in aged 
DRG, which made it a candidate for further 
investigation (fig. S8A). We next treated cul- 
tured DRG neurons with canonical NF-«B ac- 
tivators [tumor necrosis factor-a (TNF-a) or 
interleukin-18 (IL-18)] and noncanonical NF-«B 
activators (BAFF or LTo1B2) for 24 hours, fol- 
lowed by reverse transcription polymerase chain 
reaction (RT-PCR) for Cxcl13, Nfkb1, and Nfkb2. 
TNF-a and IL-1f were able to up-regulate the 
expression of Nfxbi and/or Nfkb2 but not Crcl13, 
whereas BAFF affected neither Nfkb nor Cxcll3 
gene expression (fig. S8B). 

However, LTo18$2, which is known to acti- 
vate noncanonical NF-«B2 (12), significantly 
enhanced the expression of both Nfkb2 and 
Cecl13. This effect could be attenuated via the 
selective inhibition of the NF-«B kinase IKK 
using the IKK inhibitor PS1145 (fig. S8C). Both 
NF-«B2 expression and phosphorylation were 
induced with aging, but phosphorylation of 
NF-«B2 was inhibited by PS1145 relative to a 
vehicle control (fig. S8D). Accordingly, aging- 
induced expression of Cxcl13 was significantly 
decreased by PS1145 (fig. SSE). Finally, immu- 
nostaining for pNF-«B2 demonstrated neuronal 
localization within the DRG and a significant 
increase in aged DRG neurons both before 
and 3 days after SNI (fig. S8, F and G). Thus, 
age-related expression of CXCL13 is driven 
by NF-«B2. 


CD8* T cells and neuronal MHC | are required 
for aging-dependent regenerative decline after 
sciatic nerve injury 


Next, we asked whether CD8* T cells, CD4* 
T cells, or B cells are required to impair axonal 
regeneration of aged sciatic DRG neurons. 
When CDs* T cells were depleted using a CD8a 
monoclonal antibody (mAb) administered to 
aged mice 1 week before SNI, we observed sig- 
nificantly improved nerve regeneration (Fig. 4, 
Ato C) and a significant reduction of CD8* 
T cells in the DRG relative to animals that 
received an isotype control (fig. S9). By con- 
trast, the depletion of CD4" T or B cells did not 
alter aging-dependent regenerative decline after 
SNI (fig. S10). 

These data implicate CD8* T cells in the 
regenerative decline that follows SNI in aged 
animals. RNA-seq revealed an aging-dependent 
enrichment in the expression of MHC I after 
SNI (Fig. 1B). MHC I expression increased 
before and after SNI in aged DRG neurons 
relative to young controls (Fig. 4, D to F). MHC I 
expression did not show any selectivity for 
neuronal subtypes (fig. S11, A to C). However, 
MHC I was not expressed in axons, but rather 
was restricted to macrophages in proximity to 
the sciatic nerve lesion site (fig. SI1D). 

Next, we asked whether antigen-sensitized 
CD8* T cells were required to limit neurite out- 
growth in MHC I-expressing DRG neurons in 
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culture. MHC I was induced in DRG neurons 
but absent in glial cells after IFN-y injection 
(fig. S12, A to D). TCR-transgenic OT-I CD8* 
T cells effected reduced neurite outgrowth in 
DRG neurons expressing MHC I in the pres- 
ence of the ovalbumin (OVA) peptide OVAgs57-964. 
(fig. S12, E and F) as well as an increase in 
cleaved caspase 3 without an increase in apop- 
totic features (fig. S12, G and H). The purity of 
isolated CD8* T cells from the spleen of OT-I 
mice was confirmed by flow cytometry (fig. 
$121). Thus, the presence of both neuronal 
MHC I expression and OT-I antigen-specific 
CDs" T cells is needed to reduce neurite out- 
growth in DRG neurons. 

We next explored whether MHC I-dependent 
peptide presentation was needed for axonal 
regenerative decline in aging mice. To this 
end, we used AAV to conditionally express the 
virally encoded glycine/alanine-rich peptide 
sequence GAr, which inhibits MHC I antigen 
presentation, thereby evading CD8* T cell im- 
mune responses (13, 14), in sciatic DRG of aged 
mice (fig. S13A). A sciatic nerve crush injury 
was then performed and animals were killed 
3 days later (fig. S13B). Neuronal MHC I and 
cleaved caspase 3 expression were significant- 
ly reduced after AAV-GAr-rtTA infection rela- 
tive to control AAV-rtTA (Fig. 4, G and H). 
Sciatic axonal regeneration was significantly 
enhanced upon AAV-GAr-rtTA infection (Fig. 
4,1 and J, and fig. S13, C to E). Thus, both CD8* 
T cells and neuronal MHC I are required for 
aging-dependent regenerative decline after SNI. 


CD8* T cell-dependent activation of caspase 3 
inhibits pAKT and pS6 and is required for 
aging-dependent axonal regenerative failure 


We aimed to characterize the molecular sig- 
naling underpinning CD8s* T cell-dependent 
regenerative decline in aged animals. Cleaved 
caspase 3 expression significantly increased in 
DRG neurons of aged mice 3 days after SNI 
(Fig. 4, K and L) in the absence of DNA frag- 
mentation associated with apoptosis (fig. S14A). 
Perforin and granzyme B expression signif- 
icantly increased in aged DRG neurons after 
SNI (fig. S14, B to D). Active caspase 3 can 
lead to AKT truncation and reduced AKT phos- 
phorylation (75), which in turn reduces S6 
phosphorylation. pAKT and pS6 are associ- 
ated with axonal regeneration (J6, 17) and 
are typically impaired by aging (78). Indeed, 
phosphorylation of AKT and S6 decreased in 
aged DRG 3 days after SNI. However, this ef- 
fect was significantly reversed after depletion 
of CD8* T cells (Fig. 4, M and N). CD8* T cell 
depletion significantly enhanced pAKT and pS6 
in DRG neurons while reducing the expression 
of cleaved caspase 3, perforin, and granzyme B 
(fig. S14, E to G, and fig. $15, A and B). 
Finally, the inhibition of caspase 3 with 
Z-DEVD-FMK significantly induced AKT and 
S6 phosphorylation (fig. S15, C and D) and 


increased sciatic axon regeneration (fig. S15, 
E to G). Thus, CD8* T cell-dependent activa- 
tion of caspase 3 is required to inhibit pAKT 
and pS6 in aged DRG neurons after SNI. More- 
over, the inhibition of caspase 3 activation pro- 
motes axonal regeneration. 


CXCR5*CD8* T cells drive aging-dependent 
axonal regenerative decline after sciatic 

nerve injury 

Next, we sought to determine whether CXCR5* 
CDs" T cells are directly responsible for axonal 
regenerative decline after SNI. CKCR5*CD8* 
T cells were isolated and adoptively trans- 
ferred to young or aged OT-I Raga! ~ mice, 
which lack B and T cells (fig. S16A). Prior to 
cell transfer, we found a significant increase 
in nerve regeneration from aged OT-I Raga! ~ 
mice with and without CD8 T cell depletion 
relative to wild-type mice, with no change be- 
tween the two OT-I Rag2! ~ groups (fig. S16, B 
to E). Thus, endogenous OT-I CD8* T cells had 
no effect on nerve regeneration. 

We then asked whether adoptive transfer of 
CXCR5*CD8*" T cells was sufficient to restrict 
sciatic nerve regeneration in aged mice after 
SNI. Sorted CXCR5*CD8* T cells (fig. S17, A 
and B) were injected into young or aged OT-I 
Raga! ~ recipients. Adoptively transferred 
CXCR5*CD8*" T cells robustly infiltrated aged 
DRG, but not young recipients (Fig. 5, A and B). 
By contrast, transferred CXKCR5*CD8" T cells 
accumulated more abundantly in spleens of 
young mice versus aged mice (Fig. 5, A and B). 
These migratory patterns of CXCR5*CD8* 
T cells correlated with the differential CKCL13 
levels in the DRG and spleens (Fig. 5C). Ad- 
ditionally, transferred CKCR5*CD8* T cells 
were increased in the blood of aged mice (fig. 
S17, C and D). Finally, adoptive transfer of 
CXCR5*CD8* T cells led to a significantly re- 
duced sciatic nerve regeneration after injury 
in aged mice alone (Fig. 5, D and E, and fig. 
S17E). CXCR5*CD8* T cell transfer resulted 
in a significant increase of perforin and a 
decrease of pS6 expression in aged DRG neu- 
rons (Fig. 5, F and G), whereas no changes 
were noted in the DRG of young mice (fig. 
S17, F to I). 


CXCL13 antagonism blocks the recruitment 
of CXCR5*CD8* T cells to rescue 
aging-dependent axonal regenerative 
decline after sciatic nerve injury 


Having demonstrated that recruitment of 
CXCR5*CD8*" T cells causes regeneration fail- 
ure, we asked how these cells are recruited. 
Sorted wild-type or Cxcer5/~ CD8* T cells were 
transferred to OT-I Rag2! ~ recipient mice that 
were treated with control immunoglobulin G 
(IgG) or CXCL13 mAb before SNI. Cxrers~/- 
CD8* T cell recruitment to the DRG was sig- 
nificantly lower than for wild-type CD8* T cells. 
CXCL13 mAb significantly reduced the migration 
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Fig. 4. CD8* T cells and neuronal MHC | are required for age-dependent 
regenerative decline after SNI through pAKT and pS6. (A) Representative 
SCG10 immunostaining of sciatic nerves 3 days after SNI after control IgG or 
CD8 mAb. The dashed line indicates the proximal crush site. Scale bar, 500 um. 
(B) Normalized SCG10 intensity to the proximal crush site (n = 6). (€) Analysis 

of regeneration index (n = 6). (D) Representative MHC | and Tujl coimmunostaining 
and DAPI in sciatic DRG before and 3 days after SNI. Scale bar, 50 um. (E) Fold change 
of MHC | expression shown in (D) (n = 3). (F) Percentage of antigen-presenting 
(AP) DRG neurons (n = 3). (G) Representative MHC | or cleaved caspase 3 
(cl-cas3) coimmunostaining with luciferase (Lu) and DAPI in DRG after 
AAV-rtTA or AAV-GAr-rtTA and 3 days after SNI. Scale bar, 50 um. (H) Fold 
change of MHC | and cleaved caspase 3 expressions in DRG shown in (G) 

(n = 6). (1) SCG10 immunostaining of sciatic nerves from AAV-rtTA- or 
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AAV-GAr-rtTA- infected aged mice 3 days after SNI. The dashed line indicates 
the proximal injury site. Scale bar, 500 um. (J) Normalized SCG10 intensity 
to the proximal crush site (n = 6). (K) Representative coimmunostaining 

of cleaved caspase 3, Tujl, and DAPI in DRG before and 3 days after SNI. Scale 
bar, 50 um. (L) Quantification of the percentage of cleaved caspase 3-positive 
DRG neurons (n = 4). (M) Immunoblot shows p-AKT, AKT, p-S6, S6, and Tujl 
expression in the sciatic DRG after control IgG or CD8 mAb 3 days after SNI. 
(N) Immunoblot quantification of p-AKT/AKT and p-S6/S6 in the sciatic DRG 
(n = 4). *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 [two-way ANOVA 
with post hoc Sidak’s test in (B) and (J); unpaired Student's t test in (C) 

and (H); one-way ANOVA with post hoc Tukey's test in (E), (F), (L), and (N)]. 
Data are means + SEM and represent two [(B), (C), (L), and (N)] or three 
[(E), (F), (H), and (J)] experiments. 
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Fig. 5. CXCR5*CD8* T cells cause aging-dependent regenerative decline without CXCR5*CD8* T cell transfer 3 days after SNI in OT-I Rag2~ mice. The 
after SNI. (A) Representative flow cytometry plots of endogenous and dashed line indicates the proximal injury site. Scale bar, 500 wm. (E) Analysis 
transferred CXCR5*CD8* T cells (gated TCRB*CD8* cells) from the sciatic DRG of regeneration index (n = 6 or 8). (F) Perforin or p-S6 counterstained with 


and spleen of OT-I Rag2’~ mice 3 days after SNI. Tetramer (SIINFEKL) was 
used to distinguish the host OT-| CD8 T cells from donated wild-type CD8 T cells. 
(B) Quantification of transferred CXCR5*CD8* T cells from DRG or spleen 3 days 
after SNI (n = 3). (©) CXCL13 ELISA from OT-I Rag2/- DRG or spleen (n = 3 
or 4). (D) Representative SCG10 immunostaining of sciatic nerves with or 


Tujl in the DRG from aged OT-| Rag2’~ mice 3 days after SNI with or without 
CXCR5*CD8" T cell transfer. Scale bar, 50 ym. (G) Quantification of perforin 
or p-S6 normalized intensity (n = 3). *P < 0.05, **P < 0.01, ***P < 0.001 [one- 
way ANOVA with post hoc Tukey’s test in (E); unpaired Student's t test in (B), 
(C), and (G)]. Data are means + SEM and represent three experiments. 


of CXCR5*CD8* cells to DRG (Fig. 6, A to E) and 
the spleen (Fig. 6, I and J). Sciatic nerve regen- 
eration was significantly enhanced by CXCL13 
antagonism after adoptive transfer of wild- 
type CD8"* T cells (Fig. 6, F to H), but not of 
Cxers'- CD8* T cells, which were also asso- 
ciated with higher regeneration relative to wild- 
type CD8* T cells (Fig. 6, F to H). Thus, CXCLI13 
acting on its receptor CXCR35 is required for 
CD8 T cell migration and to restrict sciatic 
nerve regeneration in aged DRG. 


CXCL13 neutralization reverses 
aging-dependent regenerative decline 

and promotes neurological functional 

recovery after sciatic nerve injury 

Finally, we investigated whether CXCLI3 neu- 
tralization promotes axonal regeneration, epi- 
dermal innervation, and functional recovery 
in aged animals. Injection of CXCL13 mAb sig- 
nificantly enhanced neurite outgrowth in DRG 
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ex vivo culture, reduced the recruitment of 
CXCR5* T and B cells, and blocked aging- 
dependent regenerative failure of sciatic nerve 
and DRG infiltration of CD8 T cells 3 days 
after SNI in aged mice but had no effect in 
young mice (Fig. 7, A to E, and fig. S18, A to F). 
However, CXCL13 neutralization did not alter 
the number of CD11b*F4-80* macrophages in 
DRG (fig. S19). 

In a separate experiment, we tested the ef- 
fects of CXCL13 antagonism on functional 
recovery. Aged mice recovered significantly 
more slowly than young mice, and CXCL13 
neutralization induced a significantly accel- 
erated recovery in aged mice in all the sensory 
modalities investigated (Fig. 7, F to I, and fig. 
S18, G to I). CXCL13 neutralization promoted 
significant skin reinnervation from the hair- 
less interdigital area of the hindpaw in aged 
mice (Fig. 7, J and K, and fig. S18, J and K). 
Thus, CXCL13 neutralization promotes axonal 


regeneration, skin reinnervation, and func- 
tional recovery in aged animals. 


Discussion 


Our work indicates that aging primes DRG 
neurons for regenerative failure after axonal 
injury by a CXCL13-dependent mechanism that 
regulates CXCR5*CD8* T cell entry into the 
DRG. Regenerative failure is a consequence 
of the CXCL13-dependent DRG recruitment of 
CXCR5*CD8* T cells that communicate with 
DRG neurons via MHC I on the neuronal cell 
surface to drive regenerative inhibitory signal- 
ing. Unlike in the aged DRG tissue micro- 
environment, young adult DRG do not harbor 
T cells before an injury and the T cells do not 
colonize the DRG parenchyma until many days 
after nerve injury, past the initiation of the 
axonal regenerative response (19). CXCL13 neu- 
tralization reduced the number of CXCR5*CD8* 
T cells into the DRG, allowing for significantly 
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Fig. 6. CXCR5 is required for CD8 T cell infiltration in aged DRG and regenera- 
tive decline after SNI. (A) Representative flow cytometry plots of endogenous 
and transferred CXCR5*CD8*" T cells (gated TCRB*CD8*" cells) from 
of OT-| Rag2’~ aged mice 3 days after SNI and treatment with IgG or CXCL13 mAb. 
(B) Quantification of transferred CXCR5*CD8* T cells from DRG shown in (A) 

(n = 3 or 4). (C) Quantification of transferred CD8* T cells from DRG (n = 3 or 4). 
(D) Representative coimmunostaining of CD8 and CXCR&9 in sciatic DRG 3 days 
after SNI. Scale bar, 10 um. (E) Quantification of the number of CXKCR5*CD8* 

T cells in sciatic DRG (n = 4). (F) Representative SCG10 immunostaining of sciatic 
nerves after control IgG or CXCL13 mAb treatment 3 days after S 
line indicates the proximal injury site. Scale bar, 500 um. (G) Analysis of 
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WT CD8 Cxer5“CD8 


the sciatic DRG 


|. The dashed 


regeneration index (n = 6 or 8). (H) Normalized SCG10 intensity to the proximal 
crush site (n 
(wild-type & anti-CXCL13 versus wild-type & IgG); tP < 0.05, ++P < 0.01 (CXCR5“~ & 
IgG versus wi 
versus wild-type & IgG). (I) Representative flow cytometry plots of endogenous and 
transferred CD8* T cells (gated TCRB*CD8* cells) from splenocytes from OT-| Rag?’ 
aged mice 3 days after SNI with control IgG or CXCL13 mAb. (J) Percentage 

of transferred 
or Cxer5~’~ CD8* T cells after sorting. *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 
0.0001 [one-way ANOVA with post hoc Tukey's test in (B), (C), (E), (G), (H), and 
(J)]. Data are means + SEM and represent four experiments. 


= 6 or 8). *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 


d-type & IgG); #P < 0.05, ##P < 0.01 (CXCR5“~ & anti-CXCL13 


CD8* T cells from spleen (n = 3 or 4). (K) Plots showing wild-type 
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Fig. 7. CXCL13 neutralization enhances axonal regeneration, epidermal 
reinnervation, and sensory functional recovery after SNI in aged mice. 

(A) Representative SCG10 immunostaining of sciatic nerves after control IgG 
or CXCL13 mAb treatment 3 days after SNI. The dashed line indicates the 
proximal injury site. Scale bar, 500 wm. (B) Normalized SCG10 intensity 

(n = 6, *P < 0.05, **P < 0.01, IgG-Y versus IgG-A; {P < 0.05, t{P < 0.01, anti- 
CXCL13-Y versus IgG-A; #P < 0.05, anti-CXCL13-A versus IgG-A). (C) Analysis 
of regeneration index (n = 6). (D) Representative flow cytometry plots of CXCR5* 
and CD8* T cells (gated CD45*TCRB*CD19° cells) from the sciatic DRG of 
aged mice 3 days after SNI. (E) Quantification of CXCR5* B, CD8*, and CD4* 
T cell number from DRG 3 days after SNI (n = 3). (F and G) Mechanical 
sensitivity by von Frey test (F) and thermosensitivity analysis by Hargreaves 
test (G) after SNI. **P < 0.01, ***P < 0.001, ****P < 0.0001 (IgG-Y versus 
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IgG-A); ¢P < 0.05, ttTTP < 0.0001 (anti-CXCL13-Y versus IgG-A); ##P < 0.01, 
####P < 0.0001 (anti-CXCL13-A versus IgG-A) (n = 10 or 11; two-way ANOVA 
with post hoc Sidak’s test). (H and 1) Sensory responses by adhesive tape 
contact (H) and removal (I) test after SNI (n = 10 or 11). (J) Representative 
coimmunostaining of PGP9.5 and DAPI shows the epidermal innervation of the 
hindpaw interdigital skin 18 days after SNI. The dashed lines indicate the 
boundary of the epidermis and dermis. Scale bar, 50 um. (K) Quantification 
of the number of intra-epidermal nerve fibers (IENF) per mm of interdigital 
skin (n = 6). *P < 0.05, **P < 0.01, ***P < 0.001, ****P < 0.0001 [two-way 
ANOVA with post hoc Sidak’s test in (B), (F), (G), (H), and (I); one-way 
ANOVA with post hoc Tukey's test in (C) and (K); unpaired Student's t test in 
(E)]. Data are means + SEM and represent three [(B), (C), and (E)] or two 
[(F), (G), (H), (I), and (K)] experiments. 
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enhanced axonal regeneration and sensory 
recovery. 

CXCLI3 is typically expressed in lymphoid 
organs to control the recruitment of B cells 
and antigen-presenting cells (20). The CXCL13 
receptor CXCR5 is highly expressed by a sub- 
set of B cells and by CD4* T cells that are 
attracted to CXCL13-rich sites such as the 
lymphoid tissue. CXCL13 has also been found 
to be expressed by astrocytes and DRG neu- 
rons regulating pain (21, 22). CKCR5*CD8* 
T cells have been described as follicular effec- 
tor memory T cells and have been implicated 
in the orchestration of the follicular T and 
B cell responses to infection (23, 24). Clonally 
expanded CDs* T cells are present in neuro- 
genic niches in the aged mouse brain whose 
activity is required to inhibit neural stem cell 
proliferation (25). However, the role of CD8* 
T cells in the nervous system can be either 
neurotoxic or neuroprotective, depending on 
the specific experimental conditions (26-28). 

We also observed that CXCR5" B cells are 
attracted by CXCL13 gradients toward the DRG 
in aged animals and after CXCLI3 overexpres- 
sion, but remain trapped in the perivenular 
space. Unlike CXCR5*CD8* T cells, CXCR5* 
B cells do not migrate into the parenchyma and 
do not affect neuronal regenerative ability. 
Whether CXCL13 and CXCR5"* B and T cells 
populate the perivenular spaces or the paren- 
chyma in the aged brain and whether they may 
contribute to tissue damage remains unexplored. 

CXCR5*CD8*" T cells are recruited and pro- 
gressively accumulate into the DRG, where they 
reside until they act as effector T cells upon 
MHC I antigen presentation by injured neu- 
rons. Although the nature of the aging and 
injury-dependent antigens remains elusive, 
we speculate that the combination of injury- 
dependent antigens with viral peptides accu- 
mulated and processed during the lifespan 
may lead to T cell-dependent neuronal MHC I 
recognition and inhibition of axonal regener- 
ation after injury. 

Neuronal MHC I has been implicated in 
synaptic maintenance of sciatic motor neu- 
rons and locomotor function after sciatic 
nerve or spinal cord injury in young animals 
(29, 30). However, its role in axonal regener- 
ation of aged sensory neurons had not been 
explored. Here, we have shown that the in- 
hibitory signaling limiting axonal regenera- 
tion in aged DRG neurons appears to rely on 
neuronal MHC I-CDs* T cell interactions that 
elicit intracellular signals such as perforin, 
granzyme B, and caspase 3 activation (37-323). 
Active caspase 3 is needed to reduce pAKT, 
likely by AKT cleavage (15), resulting in de- 
crease of downstream signals. Depletion of 
CDs" T cells or inhibition of caspase 3 activ- 
ation counteracts aging-dependent regener- 
ative decline and results in a significant increase 
in both pAKT and pS6. These are strongly de- 
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velopmentally regulated signals within the re- 
generative PI3K and mTOR pathways (17). 
Thus, the induction of these pathways in 
aged DRG neurons is likely to be crucial to 
promote regenerative signaling and merits 
further exploration. 

In line with our data, CXCL13 expression 
was previously shown to be reduced in the 
spleens and lymph nodes of aged mice relative 
to young mice (34, 35), leading to reduced ac- 
cumulation of B and T cells. The CXCL13- 
dependent shift of B cells and T cells from 
secondary lymphoid organs to peripheral tis- 
sues may be a general phenomenon leading to 
tissue damage beyond the peripheral nervous 
system. 

The delivery of CXCL13 mAbs that promote 
axonal regeneration, skin reinnervation, and 
recovery of neurological function represents 
a translational opportunity for improving re- 
pair after injury. CXCL13 neutralization may 
reduce CXCR5* follicular helper B cell and 
T cell homing, with a risk of increased sus- 
ceptibility to viral infection. However, the 
CXCLI13 mAb used here is currently in precli- 
nical development and has been effective in 
ameliorating pathology and disability in ani- 
mal models of rheumatoid arthritis and multi- 
ple sclerosis without obvious toxicity (36). Our 
results therefore suggest that CXCL13 neutral- 
ization may be considered for the treatment 
of axonal injuries affecting the elderly. More 
generally, they describe aging-specific regen- 
erative mechanisms, suggesting the need for 
targeted interventions that consider age at 
the time of injury or disease initiation. 


Materials and methods 

Mice 

Wild-type 8- to 10-week-old C57BL/6J young 
mice were obtained from Charles River, whereas 
20- to 22-month-old aged mice were supplied 
by either Charles River or by M. Jucker (Hertie 
Institute for Clinical Brain Research, Univer- 
sity Hospital Tibingen, Germany). OT-I Rago ~ 
mice on a C57BL/6 genetic background were 
provided by M. Botto (Imperial College London). 
Cucr5/~ mice on a C57BL/6 genetic back- 
ground were donated by T. Arnon (University of 
Oxford). All animal procedures were performed 
in accordance with the UK Animals Scientific 
Procedures Act (1986) and approved by the 
ethical committee of Imperial College London. 


DRG cell culture 


Sterile 15-mm glass coverslips in 24-well plates 
were coated with 0.003% poly-t-ornithine 
(Sigma) diluted in H,O for 1 hour at 37°C. 
After three washes with water, the coverslips 
were coated with laminin (1 ug/ml; Millipore) 
prepared in PBS (Invitrogen) for 3 hours at 
room temperature. After two washes with PBS, 
the wells were filled with DMEM and placed at 
37°C until cells were plated. DRGs were dis- 


sected and collected in Hank’s balanced salt 
solution (HBSS; Invitrogen) on ice. HBSS was 
then removed and replaced with 500 ul of di- 
gest solution [Dispase II (5 mg/ml, Sigma) 
and 2.5 mg/ml collagenase type II (2.5 mg/ 
ml, Worthington] in DMEM (Invitrogen) for 
40 min in a 37°C water bath with gentle tube 
flicks every 5 min. DRG were collected by cen- 
trifugation at 72g for 2 min after digestion 
and washed once with warm DRG medium 
[DMEM/F12 (Invitrogen), 1x B27 (Invitrogen), 
and 10% fetal bovine serum (FBS; Sigma)]. 
DRG were then resuspended in 1 ml of warm 
DRG medium and dissociated by pipetting 
with fire-polished Sigmacote (Sigma)-coated 
glass pipette 15 times. Undissociated tissue 
was filtered with a 100-um cell strainer (BD 
Falcon) and cells were counted under a bright- 
field microscope with a hemocytometer. After 
cell counting, cells were centrifuged, resus- 
pended with warm DRG culture medium 
[DMEM/F12 (Invitrogen), 1x B27 (Invitrogen), 
and 1% penicillin and streptomycin (Invitrogen)] 
and plated (4000 cells per well). The DRG cells 
were cultured at 37°C in 5% CO, for 24 hours. 


DRG neurite outgrowth analysis 


The average neurite length of cultured DRG 
cells was measured by using Neurolucida soft- 
ware (MBF Bioscience) with at least 100 cells 
(five fields per coverslip randomly selected) per 
condition in technical and biological triplicate. 


Immunocytochemistry 


Cultured DRG cells were fixed with ice-cold 
4% paraformaldehyde (PFA; Sigma) in 1x PBS 
for 30 min on ice and washed with 1x PBS 
three times. Cells were blocked in 1x PBS, 0.1% 
Triton X-100 (Sigma), and 5% normal goat 
serum (Abcam) at room temperature for 
1 hour followed by the incubation with pri- 
mary antibodies in 1x PBS, 0.1% Triton X-100, 
and 2% normal goat serum overnight at 4°C. 
Cells were then washed three times with 1x 
PBS and incubated with secondary antibodies 
in 1x PBS, 0.1% Triton X-100, and 2% normal 
goat serum for 2 hours at room temperature 
in the dark. Cells were washed three times 
with 1x PBS and incubated with 4,6-diamidino- 
2-phenylindole (DAPI, ThermoFisher, 62248, 
0.2 ug/ml) in 1x PBS, 0.1% Triton X-100 for 
10 min followed by two washes with 1x PBS. 
The coverslips with cells facing down were 
transferred and mounted with Antifade mount- 
ing medium (Vectashield, H-1000) on the slides. 
Detailed antibody information is provided in 
table S1. 


Immunohistochemistry 


Mice were anesthetized by ketamine-xylazine 
mixture (80 mg of ketamine and 10 mg of 
xylazine per kilogram of body weight) via 
intraperitoneal (i.p.) injection and transcar- 
dially perfused with 20 ml of ice-cold PBS 
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followed by 4% PFA in a flow rate of 5 ml/min. 
DRG, sciatic nerves, or glabrous skin of hind- 
paws were excised and post-fixed in 4% PFA 
for 2 hours on ice and transferred to 30% suc- 
rose (Sigma) for 3 days prior to embedding in 
OCT compound (Tissue-Tek). DRG were sec- 
tioned at 10 um, sciatic nerves at 12 um, and 
interdigital footpads at 30 um perpendicularly 
to the skin surface with a cryostat (Leica). Sec- 
tions were treated with a blocking solution 
containing 10% normal goat serum or normal 
donkey serum (Abcam) and 0.3% Triton X-100 
in PBS for 1 hour at room temperature and 
stained with primary antibodies at 4°C over- 
night. Sections were washed with 1x PBS three 
times followed by secondary antibody incuba- 
tion at room temperature for 2 hours. Sections 
were then stained with DAPI (ThermoFisher, 
62248, 0.2 ug/ml) for 15 min at room temper- 
ature followed by three washes with 1x PBS 
and then mounted with Antifade mounting 
medium (Vectashield, H-1000). DRG were im- 
aged with a Nikon Eclipse TE2000 microscope 
using 20x magnification (DIC Plan-Apochromat 
VC objective and QImaging OptiMOS detec- 
tion system). Sciatic nerves (20x magnification, 
HC Plan-Apochromat CS2) and epidermis (40x 
magnification with oil lens, HC Plan-Apochromat 
CS2) were imaged with a Leica TCS SP8 con- 
focal laser scanning microscope in Leica DFC9000 
GTC detection system. Detailed antibody in- 
formation is provided in table S1. 


Sciatic nerve crush injury 


Sciatic nerve was crushed following the pro- 
tocol previously described (37). Mice were an- 
esthetized with 2% isoflurane in oxygen (1 liter/ 
min) and injected with buprenorphine (0.1 mg/ 
kg body weight) and rimadyl (5 mg/kg body 
weight) subcutaneously (s.c.) for analgesia. An 
incision across the mid-thigh was made in the 
skin, the sciatic nerve was exposed by opening 
the fascial plane, and a small deeper incision 
was made between the gluteus maximus and 
the anterior head of the biceps femoris. The 
proximal side of exposed nerve where the 
three fascicles are sequentially aligned was 
placed on the bottom jaw of a super-fine he- 
mostatic forceps (Fine Science Tools, 13020-12) 
1.5 mm from their tip. The nerve was crushed 
for 30 s at three clicks of the forceps with no 
nerve stretch. The skin was then closed with 
suture clips. Mice were administered bupre- 
norphine (0.1 mg/kg body weight) twice per 
day and rimadyl (5 mg/kg body weight) daily 
by s.c. injection for 3 days post-operation. 
Sham surgery was performed to expose the 
nerves without injury. 


Retrograde tracing after nerve injury 


Mice were anesthetized following the protocol 
of sciatic nerve crush injury. The skin and mus- 
cle at the distal thigh were opened to expose the 
nerve and 1 ul of 1% Alexa Fluor 555-conjugated 
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cholera toxin subunit B (CTB, ThermoFisher) 
diluted in sterile saline was injected in the nerve 
8 mm distal to the injury site with Hamilton 
syringe and needle (NDL small RN ga34/ 
15mm/pst45°) (Hamilton) as described (38). 
The skin was closed with suture clips. 


RNA preparation and sequencing 


RNA sequencing was performed using RNA 
from the whole sciatic DRG tissue of young or 
aged mice 24 hours after sham or sciatic nerve 
injury. Specifically, sciatic DRG were dissected 
from one mouse per sample and stored in 
RNAlater stabilization solution (ThermoFisher). 
Tissue was homogenized with RNase free 
micro pestle and RNA was extracted using 
RNeasy kit (Qiagen) following the manufac- 
turer’s protocol. On-column DNase digestion 
was performed with the RNase-free DNase set 
(Qiagen) for 15 min to remove DNA. RNA con- 
centrations and purity were measured using 
Agilent 2100 Bioanalyzer (Agilent). RNA with 
RNA integrity number (RIN) above 7.5 was 
used for library preparation. Libraries were 
prepared at Ospedale San Raffaele (Milan) 
using the TruSeq mRNA Sample Preparation 
kit (lumina) and sequenced using Nlumina 
HiSeq 2500 100-cycle, paired-end sequencing. 


Bioinformatic analysis 


RNA sequencing data analysis was performed 
as published (39). FDR-corrected P value was 
implemented with Benjamini-Hochberg (BH) 
correction. DE genes (FDR < 0.05) of three 
comparisons (SNI Young, Sham Aged, and 
SNI Aged versus Sham Young, respectively) 
were analyzed for Gene Ontology (GO) and 
KEGG pathways using DAVID v6.7 (https:// 
david-d.ncifcrf.gov/) with all expressed genes 
in our dataset as the background. The catego- 
ries of GO and KEGG pathways were selected 
by P < 0.01 for both up- and down-regulation. 
Area proportional Venn diagrams were gen- 
erated using Biovenn (www.biovenn.nl/). The 
differentially up-regulated genes from SNI 
Aged versus Sham Young involved in immune 
response collected from GO and KEGG path- 
ways were analyzed for protein-protein network 
established by STRING (https://string-db.org/). 
The stringent criteria were selected using the 
active interaction sources of experiments and 
databases, as well as the highest confidence of 
the minimum required interaction score. The 
network was visualized by Cytoscape v3.7.2 
(https://cytoscape.org/). For the generation 
of Circos plots of VJ segment usage, we used 
MiXCR (v3.0.13) to identify T cell receptor 
clonotypes from whole DRG RNA-seq data. 
MiXCR’s “analyze shotgun” method was run 
directly on the fastq files from the RNA-seq 
experiment using default parameters for mouse 
RNA to identify clonotypes. The T cell recep- 
tor clonotypes were then pooled among rep- 
licates and imported to VDJtools v1.2.1. The 


“PlotFancyVJUsage” method was used to gen- 
erate the Circos plot separately for each bio- 
logical condition, using default parameters. 


CXCL13 ELISA assay 


Sciatic DRG dissected from two young or aged 
mice per sample 3 days after sham or sciatic 
nerve injury were collected and homogenized 
in 150 ul Assay Diluent A supplied from Mouse 
CXCLI13 ELISA kit (ThermoFisher) with pro- 
tease inhibitors (Roche) to produce the data in 
Fig. U. Total DRG and spleen were dissected 
from one young or aged OT-I Rag?! ~ mouse 
per sample and homogenized in 150 ul and 
2 ml of Assay Diluent A buffer with protease 
inhibitor, respectively, to produce the data in 
Fig. 5C. After homogenization, Triton X-100 
was added to a final 1% concentration. Sam- 
ples were frozen in liquid nitrogen and thawed 
followed by centrifugation at 10,000g for 5 min 
to remove the debris. Protein concentration 
was quantified using BCA protein assay kit 
(ThermoFisher) and 100 ul of cell lysate was 
used for the CXCL13 ELISA assay using Mouse 
CXCLI13 ELISA kit according to the manufac- 
turer’s protocols. CXKCL13 ELISA measurements 
were obtained by measuring absorbance on an 
ELISA plate reader set at 450 nm and sub- 
tracting 550-nm values to correct for optical 
imperfections in the microplate. CXCL13 con- 
centrations in the cell lysate were calculated 
according to the standard curve. 


Quantification of immunostaining 


Arbitrary fluorescence intensity was measured 
with ImageJ software (NIH) and normalized 
fluorescence intensity was calculated by sub- 
tracting the intensity of the background im- 
munofluorescent signal. The relative fold change 
was calculated by comparing the normalized 
fluorescence intensity of the experimental 
group versus the control group. The cells des- 
ignated positive for the expression of cleaved 
caspase 3, perforin, and granzyme B showed 
fluorescence intensity at least twofold above 
background. Longitudinal sections of sciatic 
nerve stained with SCG10 were analyzed for 
axonal regeneration. SCG10 arbitrary inten- 
sity was measured along the nerve and per- 
centage of intensity in every 0.5 mm was 
quantified away from the proximal site of 
crush. A regeneration index was evaluated 
by calculating the distance away from the 
proximal injury site where the intensity 
was reduced to 50% level with respect to the 
proximal site. For epidermal and dermal re- 
innervation, the number of IENF indicated by 
PGP9.5 staining from sagittal sections of the 
skin was measured per unit of volume. Ani- 
mals were randomly and equally grouped. 


Immunoblotting 


Sciatic DRG were dissected and collected in 
1x PBS with protease inhibitor (Roche) and 
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phosphatase inhibitor (Roche). Tissues were 
then homogenized, and protein lysate was ex- 
tracted by RIPA buffer. Protein concentration 
was measured with BCA protein assay kit 
(ThermoFisher) and 20 ug of protein was 
loaded to 10% SDS-PAGE gel and transferred 
on nitrocellulose membrane of iBlot Transfer 
Stack (ThermoFisher) by iBlot 2 Gel Transfer 
Device (ThermoFisher) under the default PO 
program. The membrane was blocked with 5% 
milk in TBST buffer for 1 hour at room tem- 
perature (RT) and incubated with primary anti- 
bodies at 4°C overnight. The membranes were 
then washed with TBST and incubated with 
horseradish peroxidase (HRP)-conjugated sec- 
ondary antibodies at RT for 1 hour followed by 
chemiluminescent detection with Amersham 
Hyperfilm (GE Healthcare Life Sciences) after 
incubation with ECL substrate (ThermoFisher). 
Detailed antibody information is provided in 
table S1. 


Flow cytometry 


For cell characterization in vivo, young or aged 
naive or sciatic nerve-injured animals were 
injected with 3 ug of APC-conjugated anti- 
CD45 (13/2.3, Biolegend) intravenously (i.v.) 
for 3 min. Peripheral blood or DRG were col- 
lected in 10 ml of 1x PBS with 2 mM EDTA and 
RPMI-1640 medium (ThermoFisher), respec- 
tively, after anesthesia. Blood cells were col- 
lected by centrifugation at 300g for 5 min at 
RT and erythrocytes were lysed using 1x RBC 
lysis buffer (Biolegend). Cells were washed 
with 1x PBS three times and filtered by 70-um 
cell strainer (Corning) to remove the clumps 
and resuspended by fresh FACS buffer (0.5% 
BSA and 2 mM EDTA in PBS) for staining. 
DRG were washed by 1x PBS once and disso- 
ciated with digestion buffer [CLSPA (0.5 mg/ 
ml, Worthington, #LS005273)] and DNase I 
(7.5 ug/ml, Roche, #10104159001) in RPMI-1640 
at 37°C for 30 min. After digestion, DRG tis- 
sues were dispersed and filtered through a 
70-uum cell strainer to collect the single-cell 
suspension and washed with rinse buffer 
(5% FBS in RPMI-1640). After centrifugation, 
the pellet was resuspended with 1 ml of DNase 
solution [250 ug/ml of DNase I in 1x DNase 
buffer (1.21 mg/ml of Tris Base, 0.5 mg/ml of 
MgCl, and 73 ug/ml of CaCl, in RPMI-1640)] 
at RT for 30 min. Cells were washed with 
FACS buffer for staining. 

Isolated cells from the blood and DRG sam- 
ples were treated with 1.0 ug of TruStain FcX 
anti-mouse CD16/32 antibody (Biolegend) per 
10° cells in a 100-1 volume to block Fc recep- 
tors for 10 min on ice and stained with prim- 
ary antibodies in FACS buffer with Brilliant 
Stain Buffer (BD Horizon). LIVE/DEAD Fix- 
able Aqua Dead Cell Stain Kit (ThermoFisher) 
was also used to exclude the dead cells from 
staining. Cells were stained with the cocktail 
antibodies at 4°C in the dark for 20 min and 


Zhou et al., Science 376, eabd5926 (2022) 


13 May 2022 


stained with PE-streptavidin (2 ug/ml, Biolegend) 
at 4°C for 20 min after washing. Fluorescence 
Minus One (FMO) and PE-streptavidin stain- 
ing controls were used for the gating bounda- 
ries. To minimize anti-CXCR5 shedding before 
cell acquisition, stained cells were immediately 
fixed with IC fixation buffer (ThermoFisher) for 
20 min at RT. Cells were washed with 1x PBS 
and resuspended with 180 pl of FACS buffer. 
Precision Counting Beads (20 ul; Biolegend, 
424902) were added to each sample before 
cell acquisition by flow cytometer LSR II (BD 
Biosciences). All DRG cells and 5000 blood 
cells or splenocytes were acquired for quan- 
tification. Detailed antibody information is 
provided in table S1. 


NF-«B and Cxcl13 expression in cultured 
DRG neurons 


Isolated DRG primary cells were plated with 
cytokines and incubated at 37°C in 5% CO, 
for 24 hours. The following compounds were 
used in cell culture: recombinant mouse TNF-o. 
(410-MT, R&D Systems, 500 pg/ml), recom- 
binant mouse IL-18 (401-ML, R&D Systems, 
600 pg/ml), recombinant mouse BAFF (8876- 
BF, R&D Systems, 100 ng/ml), and recombinant 
mouse lymphotoxin 1/82 (9968-LY, R&D Sys- 
tems, 100 ng/ml). Selective IKK inhibitor 
(10 uM; PS1145, TOCRIS, #4569) was in turn 
used with or without lymphotoxin 01/82 stim- 
ulation for 24 hours. Total RNA from cultured 
cells was extracted using RNeasy kit (Qiagen) 
according to manufacturer’s guidelines. cDNA 
was then synthesized with SuperScript II Re- 
verse Transcriptase (ThermoFisher). qRT-PCR 
was performed using KAPA SYBR FAST qPCR 
kit (Sigma, KK4601) from Applied Biosystems 
7900 Real-Time PCR System. Ct values nor- 
malized versus Gapdh as a housekeeping gene 
were exported and relative fold change of gene 
expression compared to vehicle control was 
calculated by 24“. Primers used in qPCR were 
as follows: Nfcb1 forward 5'-GAAATTCCTGAT- 
CCAGACAAAAAC-3’ and reverse 5'-ATCACTT- 
CAATGGCCTCTGTGTAG-3’', Nfcb2 forward 
5'-GGCCGGAAGACCTATCCTACT-3’ and reverse 

"-CTACAGACACAGCGCACACT-3', Cxcl13 for- 
ward 5'-CTCTCCAGGCCACGGTATT-3’ and re- 
verse 5'-TAACCATTTGGCACGAGGAT-3', Gapdh 
forward 5'-TCAACAGCAACTCCCACTCTTCCA- 
3' and reverse 5’-ACCCTGTTGCTGTAGCCG- 
TATTCA-3’. 


Cell migration in culture after 
CXCL13 overexpression 


AAV5-GFP (Vector Biolabs, #7073) or AAV5- 
Cxcl13-GFP (AAV-256398) was added (1 ul per 
well) to DRG primary cell culture (8000 cells 
per well in 24-well plates) for a 5-day incuba- 
tion. Thereafter, culture medium was collected 
and centrifugated at 300g for 5 min and 200 ul 
of culture medium was transferred to each 
receiving well of Corning HTS Transwell 96-well 


permeable supports, 5.0-um pore size (Sigma, 
CLS3387). A fresh wild-type mouse spleen was 
meshed on 70-um cell strainer and collected 
cells were treated with RBC lysis buffer to 
remove the erythrocytes. Purified splenocytes 
after PBS washing were incubated with 250 i1g/ 
ml of DNase I for 30 min at RT. After DNA 
digestion and PBS washing, cell numbers were 
counted and adjusted to the density of 5 x 10° 
cells per 100 ul by DRG culture medium. Cell 
suspension (100 ul per well) was carefully 
loaded to the transwell, which was then placed 
onto the receiving plate to make sure the 
liquid from each well was in contact with the 
transwell. The plate was incubated at 37°C for 
4 hours. After migration, the transwell was 
removed and the medium in the receiving 
plate was collected. The medium was pooled 
from six wells as one sample. Cells were 
stained and acquired by flow cytometer and 
three replicates per group were measured for 
quantification. 

To validate the successful overexpression or 
secretion of CXCL13 in cell culture, total RNA 
was extracted for cDNA synthesis followed by 
Cxcl13 GRT-PCR and DNA electrophoresis, or 
culture medium was collected for CXCLI13 as- 
say using mouse CXCL13 ELISA kit as described 
above. 


In vivo DRG neuron CXCL13 overexpression 


Eight-week-old mice were anesthetized with 2% 
isoflurane in 1 liter/min oxygen and admin- 
istered with buprenorphine (0.1 mg/kg body 
weight) and rimadyl (5 mg/kg body weight) 
by s.c. injection. The skin and muscle at the 
distal thigh were opened to expose the nerve, 
and 2 ul of AAV-GFP or AAV-CXCL13-GFP 
were injected in the nerve bilaterally at the 
distal site of the thigh with Hamilton syringe 
and needle (NDL small RN ga34/15mm/pst45°) 
(Hamilton). The skin was closed with suture 
clips. Animals were monitored for 6 weeks 
after CXCL13 overexpression. Recombinant 
mouse IFN-y (10 pg, Biolegend, #575308) and 
25% p-mannitol (200 ul, Sigma, M4125) were ip. 
and i.v. injected, respectively, into each mouse 
2 days before sciatic nerve crush. Sciatic DRG 
were collected for tissue dissociation with 
CLSPA and cells were then stained with LIVE/ 
DEAD-Aqua and cell markers for flow cytom- 
etry. Sciatic DRG and nerves were collected 
and fixed for the measurement of gene over- 
expression and axonal regeneration. 


Cell depletion experiments 


To deplete CD8* or CD4* T cells in vivo, 22- to 
24-month-old mice were i.p. injected with 
200 ug of anti-mouse CD8a (YTS 169.4, 
BioXcell, BEO117) or anti-mouse CD4 (YTS 191, 
BioXcell, BEO119). An equivalent amount of 
rat IgG2b (LTF-2, BioXcell, BE0090) was in- 
jected into the control group of animals. De- 
pletion of B cells was performed following a 
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strategy described previously (40). Aged mice 
were i.p. injected with antibody cocktail con- 
sisting of 150 ug of InvivoMAb anti-mouse CD19 
(1D3, BioXcell, BEO150), 150 ug of InvivoMAb 
anti-mouse B220 (RA3.3A1/6.1, BioXcell, BEO067), 
and 150 ug of InvivoMAb anti-mouse CD22 
(Cy34.1, BioXcell, BEOO11). Forty-eight hours 
later, the mice were injected with a secondary 
antibody InvivoMAb anti-rat kappa (MAR18.5, 
BioXcell, BEO122), at a dose of 150 ug per 
mouse. Rat IgG2a (2A3, BioXcell, BEO089), 
polyclonal rat IgG (BioXcell, BEO094), mouse 
IgG1 (MOPC-21, BioXcell, BEO083), and mouse 
IgG2a (C1.18.4, BioXcell, BEO085) isotype con- 
trols were injected in the same dosage. All 
injections were performed twice 1 week prior 
to the sciatic nerve injury and once more the 
day after injury. 


DRG neuron and OT-I CD8* T cell coculture 


Eight-week-old wild-type mice were injected 
with PBS or 10 ug of recombinant mouse IFN-y 
2 days before being killed for DRG dissection 
and DRG neuron culture. Naive OT-I CD8* 
T cells were purified from 8-week-old OT-I mice 
using the MojoSort Mouse CD8 T Cell Isola- 
tion kit (Biolegend). Purified live CD8* T cells 
were counted by Trypan Blue (ThermoFisher) 
staining. Approximately 4000 DRG neurons 
per well were plated with 10,000 OT-I CD8* 
T cells with or without OVAo57_964 peptide 
(100 nM, SIINFEKL, InvivoGen) at 37°C for 
24 hours. The culture medium was then re- 
moved and the cells were fixed with 4% PFA. 
This was followed by two washes with 1x PBS 
for immunostaining. 


GAr-rtTA-—mediated immune evasion 


AAV5-rtTA or AAV5-GAr-rtTA viral particles, 
which were provided by J. Verhaagen (Netherlands 
Institute for Neuroscience), were mixed with 
the viral particles expressing doxycycline- 
inducible luciferase (AAV5-dox-i-luciferase) in 
a 1:1 ratio to be titer-matched by PBS+5% 
sucrose dilution. Viral mix (3 ul) was injected 
into the sciatic nerve of aged mice bilaterally. 
Five weeks later, doxycycline (Stemcell, #72742) 
reconstituted in PBS was delivered by daily i-p. 
injection at a dosage of 50 mg/kg body weight 
for 1 week before sciatic nerve injury and 3 days 
after injury. Upon killing, sciatic DRG and 
nerves were dissected for immunostaining. 


CXCL13 neutralization 


CXCLI3 mAb (5378) and control IgG2a mAb 
(2510) were kindly supplied by Vaccinex Inc. 
(Rochester, NY). For DRG ex vivo culture, young 
and aged mice were treated with control IgG 
or CXCLI3 mAb (30 mg/kg body weight, i-p.) 
three times 1 week before DRG dissection and 
primary cell culture. Isolated DRG cells were 
incubated at 37°C in 5% CO, for 24 hours 
followed by neurite outgrowth measurement. 
For CXCL13 neutralization in vivo, control IgG 


Zhou et al., Science 376, eabd5926 (2022) 


13 May 2022 


or CXCL13 mAb (30 mg/kg body weight) was 
ip. injected into the young or aged mice 
4 hours after sciatic nerve crush and daily 
until killing on day 3. For the chronic nerve 
regeneration study, 30 mg/kg body weight 
control IgG or CKCL13 mAb was i.p. injected 
three times per week until the animals were 
killed beginning 4 hours after injury. 


Adoptive cell transfer 


Spleens from wild-type or Cxcr5™ ‘- mice were 
harvested. Splenocytes were extracted after red 
blood cell lysis using RBC lysis buffer (Bio- 
legend, 420302). CD8* T cells were then puri- 
fied with mouse CD8* T cell isolation kit 
(Miltenyi Biotec, 130-104-075) and LS columns 
(Miltenyi Biotec, 130-042-401). Wild-type CD8* 
T cells or Cxcer5/~ CD8* T cells (2 x 10’) were 
iv. injected into each aged OT-I Rag2! ~ mouse 
(20 to 22 months old) and a sciatic nerve crush 
was performed 3 days after cell transfer. Con- 
trol IgG or CXCL13 mAb (30 mg/kg body 
weight) was i.p. injected into the recipients 
immediately after cell transfer and daily until 
killing on day 3 after nerve injury. For wild- 
type CXCR5*CD8" T cell transfer, purified 
CD8* T cells from spleen and lymph nodes of 
adult wild-type mice (4 to 6 months old) were 
stained with LIVE/DEAD, anti-CD8, anti-TCRB, 
biotin-conjugated anti-CXCR5 antibodies for 
20 min followed by washing and staining with 
APC-streptavidin. Live TCRB*CD8*CXCR5* 
cells were sorted with BD FACSAria III flow 
cytometer. Cells (5 x 10°, sorted from spleen 
and lymph nodes of ~10 donors) per recipient 
were adoptively transferred to the young (8 to 
10 weeks old) and aged OT-I Raga! ~ mice (20 to 
22 months old). Control groups were injec- 
ted with an equivalent amount of PBS. 
Three days later, a sciatic nerve crush was per- 
formed, and the animals were killed 3 days 
after nerve injury. DRG, spleen, and blood 
samples were then collected after a 3-min 
labeling of blood-exposed cells by i.v. injection 
of PE-Cy7-conjugated anti-CD45. Isolated DRG 
cells, splenocytes, and blood cells were stained 
with LIVE/DEAD, primary antibodies, and tet- 
ramer (SIINFEKL) for 30 min at room temper- 
ature. The cells were washed and stained with 
APC-streptavidin (2 ug/ml) for 20 min at 4°C. 
Cell samples were acquired by flow cytometer 
LSR II (BD Biosciences). Detailed antibody in- 
formation is provided in table S1. 


In vivo administration of IKK and 
caspase 3 inhibitors 


PS1145 (TOCRIS, #4569) was reconstituted with 
0.5% methyl cellulose (Sigma), 0.2% Tween-80 
(Sigma) in water. The suspension was vortexed 
for 2 min and sonicated at 37°C for 20 min 
(LabSonic, B. Braun Biotech International; 
0.5-mm-diameter sonotrode, three times per 
5 s, amplitude 80%). The methyl cellulose so- 
lution or PS1145 suspension was delivered at 


50 mg/kg body weight by intragastric gavage 
immediately after sciatic nerve crush and at 
day 2. Specific inhibition of caspase 3 in vivo 
was performed by iv. delivery of Z-DEVD-FMK 
(Santa Cruz, #210344-95-9) in 1% DMSO ata 
dosage of 20 mg/kg body weight daily after 
sciatic nerve injury. 


Behavioral assessment 


Mice with sciatic nerve crush and chronic ad- 
ministration (ip. injection) of control IgG or 
CXCLI3 mAb (30 mg/kg body weight, three 
times per week) were used for behavioral as- 
sessments. The left side of sciatic nerve and 
hindpaw were injured and behaviorally as- 
sessed at day 0 and from day 7 to day 42 fol- 
lowed by the tissue collection at day 43. Animals 
were randomly and equally grouped. All in- 
vestigators participating in behavioral as- 
sessments were blind to the groups and the 
treatments (41). 


Mechanical sensitivity test 


Mice were placed in the test chamber for 
30 min to acclimate before initiating the be- 
havioral assessment. Mechanical sensitivity 
was determined by probing the plantar sur- 
face of the left hindpaw with calibrated Von 
Frey filaments ranging from 0.4 to 4 g. To give 
the sensory receptors enough time to return to 
baseline, the interval time was at least 30 s 
between each trial. A quick hindpaw with- 
drawal was considered a positive response. Five 
trials were conducted for each mouse. The 
withdrawal threshold to the mechanical stim- 
ulation was defined as the lowest force that 
resulted in at least three withdrawals in five 
trials, as described (42). 


Thermal sensitivity test 


For thermal sensitivity, the Hargreaves test 
(Ugo Basile, catalog number 37370) was per- 
formed as described (43, 44). Briefly, mice 
were placed on a glass floor and separated by 
plastic chambers. Mice were given 30 min to 
acclimate. The thermal heat stimuli (infrared 
radiation source, intensity = 50) were carefully 
placed under the plantar surface of the hind- 
paw for no more than 10 s (44). The hindpaw 
withdrawal time was recorded automatically. 
Five trials were conducted on each paw and 
the average was calculated including the lon- 
gest and shortest withdrawal time. 


Adhesive removal 


Small adhesive stimuli (6-mm round adhesive 
labels) were placed on the left hindpaw of the 
mouse. The time to make first contact with 
both forepaws or mouth, as well as the time 
to completely remove the adhesive, were then 
recorded. Daily 10-min training sessions per 
mouse were performed for 1 week prior to the 
nerve injury. Each mouse was subjected to 
three trials. All testing was performed in the 
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animal’s home cage by a blind investigator as 
described (39, 45). 


Statistical analysis 


Data were statistically analyzed with Graphpad 
Prism 9.3 (Graphpad Software Inc., La Jolla, 
CA) and expressed as mean + SEM unless 
otherwise stated. Statistical comparisons were 
analyzed using unpaired two-tailed Student’s 
t test for two groups. Multiple groups were 
analyzed using one-way analysis of variance 
(ANOVA) with post hoc Tukey’s correction or 
two-way ANOVA with post hoc Sidak’s test. 
Statistical significance was considered as P < 
0.05. All experiments were replicated two or 
three times. The 7 defining biological repli- 
cates was always determined by power analy- 
sis with power set at 80% and significance 
level at 5%. 
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QUANTUM SIMULATION 


Quantum gas microscopy of Kardar-Parisi-Zhang 


superdiffusion 


David Wei*?, Antonio Rubio-Abadal’*+, Bingtian Ye*, Francisco Machado*“, Jack Kemp*, 
Kritsana Srakaew?”, Simon Hollerith'”, Jun Rui*?+, Sarang Gopalakrishnan®®, Norman Y. Yao*“, 


Immanuel Bloch”, Johannes Zeiher'2* 


The Kardar-Parisi-Zhang (KPZ) universality class describes the coarse-grained behavior of a wealth 

of classical stochastic models. Surprisingly, KPZ universality was recently conjectured to also describe 
spin transport in the one-dimensional quantum Heisenberg model. We tested this conjecture by 
experimentally probing transport in a cold-atom quantum simulator via the relaxation of domain walls in 
spin chains of up to 50 spins. We found that domain-wall relaxation is indeed governed by the KPZ 
dynamical exponent z = 3/2 and that the occurrence of KPZ scaling requires both integrability and a 
nonabelian SU(2) symmetry. Finally, we leveraged the single-spin—-sensitive detection enabled by the 
quantum gas microscope to measure an observable based on spin-transport statistics. Our results yield 
a clear signature of the nonlinearity that is a hallmark of KPZ universality. 


n many-particle systems, hydrodynamics 

(CZ, 2) naturally emerges upon coarse grain- 
ing from the microscopic equations of 
motion in both classical and quantum 
systems (3-7). A celebrated example of the 
emergence of hydrodynamics is the so-called 
Kardar-Parisi-Zhang (KPZ) equation, which 
governs an abundance of disparate phenomena 
ranging from interface growth to the shapes 
of polymers, the spreading of sound waves 
in fluids, and the growth of entanglement in 
random quantum circuits (8-11). A single 
equation can describe so many distinct phys- 
ical systems because of the notion of univer- 
sality (12). Indeed, the physical properties of 
systems described by the KPZ equation follow 
scaling functions that were exactly computed 
in a milestone mathematical achievement (73). 
All of these canonical KPZ systems share the 
feature that their dynamics is highly robust: 
KPZ scaling requires no particular symmetries 
and occurs in the presence of external noise (74). 
Recent efforts have focused on the predic- 
tion that KPZ hydrodynamics should emerge in 


1Max-Planck-Institut fir Quantenoptik, 85748 Garching, 
Germany. “Munich Center for Quantum Science and 
Technology (MCQST), 80799 Munich, Germany. 2Department 
of Physics, University of California, Berkeley, CA 94720, USA. 
‘Materials Science Division, Lawrence Berkeley National 
Laboratory, Berkeley, CA 94720, USA. Department of 
Physics, The Pennsylvania State University, University Park, 
PA 16802, USA. °Department of Physics and Astronomy, 
College of Staten Island, Staten Island, NY 10314, USA. 
7Fakultat fur Physik, Ludwig-Maximilians-Universitat, 80799 
Munich, Germany. 
*Corresponding author. Email: johannes.zeiher@mpq.mpg.de 
{Present address: Institut de Ciencies Fotoniques (ICFO), The 
Barcelona Institute of Science and Technology, 08860 Castelldefels 
(Barcelona), Spain. 

Present address: Hefei National Laboratory for Physical Sciences at 
the Microscale, University of Science and Technology of China, 
Hefei, Anhui 230026, China. 


716 = 13 MAY 2022 » VOL 376 ISSUE 6594 


an entirely distinct setting: the spin-% quan- 
tum Heisenberg chain at infinite temperature 
(15-21). The appearance of KPZ scaling in such 
a quantum spin chain is a surprise because 
the chains lack the properties shared by all 
canonical KPZ systems. In particular, noise is 
absent and the KPZ scaling is fragile—it re- 
quires integrability (6, 22-24), a feature of the 
Heisenberg spin chain that does not apply to 
canonical examples of KPZ. The very appear- 
ance of nontrivial dynamical scaling in high- 
temperature near-equilibrium states is 
unexpected: The theory of dynamical critical 
phenomena (25) does not apply because high- 
temperature states are far from any equilibrium 
critical point. Moreover, nonlinear hydrody- 
namics predicts that transport in spin chains at 
nonzero temperature should be diffusive (26). 

Notably, recent conjectures suggest that the 
emergence of KPZ hydrodynamics in quan- 
tum many-body systems could be substantially 
more general and may apply to any integrable 
model that exhibits nonabelian symmetry 
(27, 28), suggesting a fundamental difference 
from the mechanisms of canonical KPZ (27, 27-3D). 
To date, a full theory of KPZ hydrodynamics in 
the Heisenberg model remains elusive (26, 32). 
In light of these questions, experimental char- 
acterization of the anomalous dynamical expo- 
nents of spin transport has been the subject of 
widespread effort (33-36). However, because a 
superdiffusive exponent can arise from a num- 
ber of distinct microscopic origins (37), it is 
essential to characterize a system beyond the 
dynamical exponent to establish its hydro- 
dynamical universality class. 

In this work, we explore the superdiffusive 
dynamics of the ferromagnetic Heisenberg 
model by using a quantum gas microscope 
with single-site resolution and single-spin- 


sensitive detection in spin chains of up to 
50 spins. Our main results are threefold. 
First, we observe superdiffusive spin transport 
with the dynamical exponent z = 3/2, con- 
sistent with KPZ hydrodynamics. Second, we 
demonstrate that both integrability and non- 
abelian symmetry are essential for observing 
superdiffusion: Breaking integrability by tuning 
dimensionality restores diffusion, and breaking 
the symmetry by preparing an initial state with 
net magnetization leads to ballistic transport 
(Fig. 1A). Finally, leveraging the ability of our 
experimental setup to detect spin-resolved 
snapshots of the entire sample, we map the 
shot-to-shot dynamical fluctuations (i.e., the 
“full counting statistics”) of the magnetization. 
These fluctuations carry clear signatures of 
the intrinsic nonlinearity associated with KPZ 
hydrodynamics (38) and distinguish it from 
other potential mechanisms for superdiffusion, 
such as Lévy flights (26). 


Experimental system 


In our experiment, we probed the transport 
dynamics of bosonic *’Rb atoms trapped in 
an optical lattice; the atoms occupy the two 
hyperfine ground states |t) = |F =1,mp = 
—1) and ||) = |F = 2, my = —2) (where F is 
the total angular momentum and 7m; is the 
Zeeman sublevel), and their dynamics are cap- 
tured by a two-species Bose-Hubbard model 
with on-site interaction U and tunnel cou- 
pling ¢. At unit filling and in the limit of 
strong interactions, the direct tunneling be- 
tween lattice sites is suppressed, and spin dy- 
namics occur via second-order spin-exchange. 
The system can be mapped to the spin-%4 XXZ 
model for |f) and ||) (39, 40), and, in one 
dimension, is described by the Hamiltonian 


A AZAR 


A at Ak yay 
H=—J)) (S5Sh4 +85 S541 + A3;8;,1) (1) 
J 


where A quantifies the interaction anisotropy, 
J=4F /U characterizes the spin-exchange 
coupling, and j denotes the lattice sites. In 
our system, the atomic scattering properties 
yield A = 1, and the system maps to the iso- 
tropic ferromagnetic Heisenberg model (47). 
We began our experiment by loading a 
spin-polarized two-dimensional (2D) degen- 
erate gas of ~2000 atoms into a square op- 
tical lattice with a spacing of a = 532 nm. We 
achieved a homogeneous box potential over 
an area of 50 sites by 22 sites by additionally 
projecting light at a wavelength of 670 nm 
with a digital micromirror device (DMD), pre- 
paring a Mott insulator with a filling of no = 
0.93(1) in this box [see details in (41)]. Local 
spin control was realized using light at a wave- 
length of 787 nm on the DMD (42) to apply a 
site-resolved differential light shift between |1) 
and ||); subsequent microwave driving allowed 
for local flips of the spatially addressed spins. 
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Fig. 1. Hydrodynamic transport A 1D 1D) 2D B 
in Heisenberg chains and z=1 8=0.0 8=0.8 8=0.0 
schematic of the experimental ballistic Oo | 5 
system. (A) Dynamical exponents 1 

for finite-temperature Heisenberg = 3/2 
chains. Whereas integrable = 2 4 
systems typically display ballistic 2 5 t 

transport (magnetized chains, s g 7=3/2 

& > 0), nonintegrable systems 5 KPZ 

are generically diffusive (2D $ superdiffusive = 
Heisenberg model, J, > 0). For diffusive 


unmagnetized Heisenberg chains, 
transport is expected to fall 

into the KPZ universality class 
with a superdiffusive exponent 

z = 3/2. (Inset) By measuring polarization transfer P(t) across a domain 
wall, we directly observe these transport regimes: superdiffusion in the 
unmagnetized case (green), ballistic transport at finite net magnetization 
(blue), and diffusion in two dimensions (orange). Exponents are extracted by 
fitting P(t) o< t/; for the ballistic case, we additionally fit a vertical intercept 
to account for transient initial-time dynamics. Error bars denote SD of the fit. 
(B) In each experimental run, we measure the spin states of a Heisenberg 
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chain (top) by removing one spin species (center) and imaging the atomic 
site occupation (bottom). (©) The Heisenberg chains are achieved in a 

2D atomic Mott insulator (analysis region depicted) with controllable 
interchain coupling. Our setup allows us to prepare domain walls with 
high-purity n (left and middle columns) and low-purity y (right). We 
measure the time evolution of both |t) (top) and ||) (middle and bottom 
rows) atoms to extract the polarization transfer. 


Such quantum control enabled us to pre- 
pare spin domain walls (16, 17, 43, 44) by 
spatially addressing half of the system. Sub- 
sequently, we prepared high-entropy states 
by globally rotating the spins away from the 
S* axis by using a resonant microwave pulse 
and then locally dephasing them by project- 
ing a site-to-site random spin-dependent po- 
tential, which we modified from shot to shot 
(41) (Fig. 1C). More precisely, our experiments 
focused on tracking spin dynamics, starting 
from a class of initial states containing a spin 
domain wall with magnetization difference 
2n in the middle of the spin chain—i.e., half of 
the system has magnetization n; the other 
half has magnetization -n. In the infinite- 
temperature limit, n — 0, the relaxation of 
such states yields linear response transport 
coefficients, as the derivative of the spin pro- 
file is precisely the dynamical spin structure 
factor (16, 17). 

To probe 1D spin dynamics in our system, we 
rapidly quenched the lattice depth along 1D 
tubes with a length of 50 sites, which suddenly 
increased the spin-exchange coupling from 
zero to J/h = 64(1)s~! (where f is Planck’s 
constant divided by 2z). After tracking the 
spin dynamics for up to ~45 spin-exchange 
times t = f/J, we removed one spin compo- 
nent and measured the remaining occupation 
via fluorescence imaging (Fig. 1B). 


Superdiffusive spin transport 


To explore the nature of anomalous spin trans- 
port in the 1D Heisenberg model, we initialize 
the spins in a high-entropy domain-wall state 
with n = 0.22(2). We characterize the subse- 
quent spin transport by measuring the polar- 
ization transfer, P(t), defined as the average 
total number of spins that have crossed the 
domain wall by time ¢ (47). The emergence of 


SCIENCE science.org 


A 


Polarization transfer P 


Time t/t 


Time t/t 

B 3 10 30 
7 
5 2.0 
5 1.05 | 
re 
a 03 
: 0.22 [as 
9.02 [sree | J 
a sete 
2 ‘e 
= 0.00} ‘. ; 
: ee 
G -0.22 F ee 
2 1 1 

-2 0) 2 


Scaling parameter j/(t/x)*” 


Fig. 2. Superdiffusive spin transport in a high-temperature Heisenberg chain. (A) The polarization 
transfer for a domain-wall initial state with a contrast of n = 0.22(2) grows as a power law [P(t) c< t/7] 
with a fitted exponent z = 1.54(7) (solid line), indicating superdiffusive transport. The experimental 

data are consistent with numerical Heisenberg-model simulations (41) (dashed line). The insets show the 
averaged spin profiles 2S7(t) at times t/t = 0, 10, 26, which are compared to simulations (dashed lines). 
(B) Polarization transfer in a double-logarithmic plot. The solid lines are power-law fits with fixed exponents, 
where a distinction between z = 3/2 (green) and both z = 2 (brown) and z = 1 (blue) is visible. (©) When 
rescaling time by the inverse dynamical exponent, the spatial spin profiles at times t/t = 5 to 35 (light to dark 
green) collapse to a characteristic shape consistent with the integrated KPZ function. Error bars denote SEM. 


hydrodynamics is characterized by the power- 
law scaling of P(t) ~ ¢/* and immediately en- 
ables us to extract the underlying dynamical 
exponent z. As depicted in Fig. 2A, the data 
exhibit a superdiffusive exponent, z = 1.54(7), 
consistent with KPZ scaling. By comparison, 
neither a diffusive (z = 2) nor a ballistic (g = 1) 
exponent accurately captures the observed dy- 
namics (Fig. 2B) (41). Somewhat surprisingly, 
we also observe a superdiffusive exponent of 
g = 1.45(5) upon changing the initial state to a 
near-pure domain wall with n = 0.95(2) (fig. S8) 
(26, 41, 44-46). 

To further explore the superdiffusive dynamics, 


we investigate the spatially resolved spin profiles 


at 1 = 0.22(2). Our experimental observations 
are in quantitative agreement with simu- 
lations based on tensor-network numerical 
techniques (41, 46, 47) and conform to KPZ 
dynamics (Fig. 2A). Crucially, when appro- 
priately rescaled by the dynamical exponent, 
all of the observed spatiotemporal profiles 
collapse onto a scaling form consistent with 
the KPZ scaling function (Fig. 2C). 


Microscopic origins of superdiffusion 


To understand why the combination of inte- 
grability and nonabelian symmetry leads to 
emergent superdiffusive transport, it is instruc- 
tive to first consider the transport dynamics on 
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Fig. 3. Evolution toward diffusive 
transport under a breakdown of 
integrability. Fitted power-law exponent 
z for the spin polarization transfer at 
different coupling strengths between 
individual 1D chains with initial domain 
walls with n = 0.9 (41). Starting from 
superdiffusive transport in the purely 1D 
case, Z = 1.45(5), increased interchain 
coupling breaks the integrability of the 
system and leads to a crossover toward 
diffusive transport, reaching z = 2.08(4) 
in the 2D case, as generically expected 
for nonintegrable systems. The inset 
depicts the normalized polarization 
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Fig. 4. Ballistic spin dynamics under broken 
SU(2) symmetry. (A) Averaged experimental (top) 
and numerical (bottom) spin profiles S(t), from 
which the initial profile s is subtracted. 

(Left) Unmagnetized low-purity domain wall, 

8 = 0,n = 0.22(2) (from Fig. 2). Spin transport 
esults from increased spin profile width, which 
scales with the superdiffusive dynamical exponent. 
The solid lines indicate the position j where the spin 


ile crosses lest) - 8|/n = 0.4; because the 


iles themselves are scale invariant, the position 
of any S’ value follows the z = 3/2 scaling. (Right) 
agnetized domain wall, 6 = 0.80(1),n = 0.12(1). 
At the outer edge, the contribution of magnons is 
visible, transporting spin with the speed of the spin 
“light cone” (dashed line), which was measured 
with a quantum walk (41). Most of the spin is carried 
by quasiparticles within the light cone, and thus 
the width of the profile (solid line) grows faster than 
in the unmagnetized case. The numerical simulation 
shows a qualitatively similar behavior. At t/t = 25 the 
magnons reach the system edge and are reflected. 


top of a small net magnetization background 
(18-20, 48). In our experiments, this corre- 
sponds to preparing domain walls with a 
finite overall magnetization 5—i.e., half of 
the system has a magnetization n + 6; the 
other half has —n + 6. Stable quasiparticles 
then render spin transport ballistic (Fig. 1A), 
leading to a characteristic polarization-transfer 
rate that scales linearly with net magnetiza- 
tion 6 (20). Even when 6 = O on average, 
random local fluctuations of the magnetiza- 
tion will be present; thus, the net magnetiza- 
tion in a typical region of size ¢ will scale as 
V/ Ve. Therefore, the average spin-transport 
rate across a region of size ¢ also scales as 
V/ Ve implying that the transport time across 
the region scales as £/(1/V0) ~ @/?, precisely 
yielding the KPZ exponent z = 3/2 [see details 
in (4D). 
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(B) To extract the ballistic polarization-transfer velocity, we linearly fit the normalized polarization transfer 
after a crossover time, t/t > 16 (left). We observe transfer velocity growth as the initial domain-wall 
magnetization 6 increases (right, light to dark blue). Error bars denote SD of the fit. 


This intuitive analysis suggests two key re- 
quirements for superdiffusive transport: (i) in- 
tegrability ensures the presence of stable 
quasiparticles that move ballistically, and (ii) 
the presence of nonabelian SU(2) symmetry 
makes the characteristic velocity of the ballistic 
contribution to spin transport vanish. We can 
experimentally probe these requirements by 
individually breaking either the integrability or 
the SU(2) symmetry of the system. 

To break integrability, we turn on a finite 
interchain coupling J, by lowering the lattice 
depth orthogonal to the 1D spin chains, which 
effectively causes the system to become 2D 
(49, 50). We measure the dependence of the 
polarization transfer on the interchain coupling, 
starting from an unmagnetized domain wall 
[n = 0.9 (41), 6 = 0]. As shown in Fig. 3, the 
extracted dynamical exponents exhibit a clear 


flow from superdiffusive transport when J, = 0 
to diffusive transport, z= 2.08(4), when 
J, = J. Notably, for J, /J < 0.1 integrability 
is strictly broken, but the transport dynamics 
remain consistent with superdiffusion within 
experimentally accessible time scales. This 
observation bolsters recent theoretical expec- 
tations, which suggest that superdiffusion can 
be particularly robust to perturbations that do 
not break the nonabelian symmetry (57). 

Next, we explore the effect of breaking the 
underlying SU(2) symmetry by using initial 
states with finite net magnetization 6 (41). 
Working with an imbalanced domain-wall 
initial state [n = 0.12(1), 8 = 0.80(1)], we ob- 
serve two main differences from the unmag- 
netized (8 = 0) case (Fig. 4A). First, the 
polarization profile exhibits a fast ballistic 
component that follows the spin “light cone” 
of the dynamics (with a speed of 1/1; dashed 
line in Fig. 4A). This contribution arises from 
the fastest quasiparticles, which transport spin 
above the magnetized background (41, 52). 
Second, within this light cone, polarization 
spreads substantially faster than in the un- 
magnetized case; this comprises the bulk 
of the spin transport and is mediated by 
slower-moving, net-magnetization-carrying 
quasiparticles. 

At early times, the polarization-transfer dy- 
namics exhibit a superdiffusive power law be- 
fore crossing over to linear ballistic transport 
at later times (47). In particular, by fitting a 
power law to the late-time data, t/t > 16, we 
extract a dynamical exponent z = 0.9(3), con- 
sistent with ballistic spin transport (Fig. 4B). 
Although our results agree qualitatively with 
numerical simulations of the Heisenberg model, 
the magnitude of the measured polarization 
transfer is smaller; this can be understood as 
resulting from the presence of hole defects in 
the initial state (41, 53). In addition to verifying 
the ballistic nature of the spin dynamics, we 
can also directly extract the velocity of the 
underlying quasiparticles. By controlling the 
overall magnetization of the initial state, we 
observe the expected increase of the velocity 
with 6 (Fig. 4B), an essential component for 
understanding the presence of KPZ super- 
diffusion in spin chains (20). 


Observing KPZ hydrodynamics 


Our previous observations have focused on 
characterizing superdiffusive spin transport. 
However, from the perspective of observing 
KPZ universality, this is insufficient, as mul- 
tiple different classes of hydrodynamics can 
exhibit the same dynamical exponent of z = 
3/2. To distinguish these classes, we go beyond 
measurements of the average polarization 
transfer and analyze the full distribution 
function of the polarization transfer Pr(P; t) 
across snapshots (38). This distribution function 
can distinguish KPZ from potential alternatives 
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Fig. 5. Distribution function of polarization transfer. (A) The probability distribution asymmetry of the 
polarization transfer expected for KPZ transport is quantified by the skewness. We compare the pure 
domain-wall dynamics in the 1D case (green) with the nonintegrable 2D case at J, /J = 0.25(1) (orange). 
Whereas the 2D case becomes symmetric at late times, the 1D distribution remains asymmetric with 

a skewness of 0.33(8). Gray lines indicate the skewness of the Gaussian-orthogonal-ensemble (GOE) and 
Gaussian-unitary-ensemble (GUE) Tracy-Widom (TW) distributions (41, 54). Colored lines serve as guides 
to the eye. (Insets) Probability distributions of the polarization transfer on a logarithmic scale. The vertical 
line marks the mean of the distribution. (B) The mean (circles) of the polarization transfer is consistent 
with the data shown in Fig. 3 and scales with the power-law (solid lines) exponent [1/z = 0.67(1) in 

one dimension; 1/z = 0.60(2) in two dimensions]. The standard deviation (triangles) features another 
characteristic transport exponent [the growth exponent (55)], which is consistent with the extracted power- 
law (dashed lines) exponent [8 = 0.31(1) in one dimension; B = 0.24(1) in two dimensions]. Error bars 


denote the SD obtained from a bootstrap analysis. 


such as Lévy flights: For all linear processes 
(such as Lévy flights or time-rescaled diffu- 
sion), the fluctuations of P at late times are 
necessarily symmetric about the mean; for 
KPZ, the limiting distribution Pr(P;t—~) is 
the Tracy-Widom (TW) distribution (41), which 
is strongly asymmetric (10, 54). 

Measuring the statistics of the polarization- 
transfer distribution therefore gives us a direct 
experimental observable to discern the un- 
derlying hydrodynamical transport equations. 
This analysis fundamentally relies on the single- 
shot nature and single-spin sensitivity of our 
quantum gas microscope. As we measure the 
occupation of a single-spin species per snapshot, 
we approximate the polarization-transfer statis- 
tics by the statistics for the single-species atom- 
number transfer, nit dn P/2, where Mt is the 
number of |t) atoms on the side of the domain 
wall initialized with the opposite spin ||). We 
quantify the asymmetry of the distribution 
about its mean P by its skewness (u,(t) — 
11s(0))/(Ho(t) — w2(0))”” (42), where p,.(¢) = 
>> p(P — P) Pr(P; t) denotes the Ath central 
moment of the distribution. 

To begin, we characterize the skewness 
of the polarization transfer starting from 
a high-purity domain wall [yn = 0.89(1), 6 = 
0] for a 2D geometry with an interchain 
coupling strength J, /J = 0.25(1). The skew- 
ness of the polarization transfer distribu- 
tion is small overall and is most consistent 
with a decay toward zero (Fig. 5) (42, which 
corresponds to a fully symmetric distribu- 
tion and is expected for linear diffusive pro- 
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cesses exhibited by the nonintegrable 2D 
Heisenberg model. 

If the 1D Heisenberg model is actually 
governed by nonlinear KPZ hydrodynamics, 
one would expect markedly distinctive behav- 
ior for the skewness as a function of time. In 
particular, the nonlinearity of the KPZ equa- 
tion would lead to a finite skewness, which is 
constant over time. We indeed observe that 
the skewness saturates to a finite value of 
0.33(8) when starting from an initial state 
with n = 0.91(2) and 6 = 0 (Fig. 5). In agree- 
ment with numerical simulations, this value 
is consistent with the skewness of the Gaussian- 
orthogonal-ensemble (GOE) TW distribution, 
0.294 (54). Our experiment directly rules out 
linear transport processes and thus provides 
a strong indication that transport in the 1D 
quantum Heisenberg chain is governed by 
KPZ hydrodynamics. 


Discussion and conclusion 


Our results support the theoretical conjecture 
that spin transport in the 1D Heisenberg 
model belongs to the KPZ universality class, 
with a superdiffusive transport exponent z = 
3/2. We have experimentally demonstrated that 
both integrability and nonabelian symmetry are 
essential for stabilizing superdiffusive transport. 
Moreover, we exploit the single-spin sensitivity 
of our setup to extract the full distribution func- 
tion of the polarization transfer. This distribu- 
tion function exhibits a large skewness that 
does not decay in time, demonstrating that 
spin transport in this system belongs to a 


strongly coupled, nonlinear dynamical uni- 
versality class. 

Our work builds and expands on recent ex- 
perimental explorations of Heisenberg-model 
spin dynamics. These experiments include neu- 
tron scattering studies of the quantum material 
KCuF; (36), as well as experiments that probe 
the relaxation of spin-spiral initial states in 
ultracold gases (33-35). In the 1D Heisenberg 
model, the relaxation of such spin-spiral states 
is nongeneric because they are approximate 
eigenstates in the long-wavelength limit (26). 
Empirically, spin spirals relax with a diffusive 
exponent 2 = 2. By considering a more generic 
family of domain-wall initial states, we are able 
to directly probe (and controllably move away 
from) the high-temperature linear-response 
limit where KPZ transport is thought to occur. 

Our results open the door to a number of 
future directions. First, the discrepancy be- 
tween the relaxation of domain walls and 
spin spirals (away from linear response) indi- 
cates that relaxation in integrable systems is 
generally strongly dependent on initial state, 
although we currently lack a theory of this 
nonlinear regime. Second, the robustness 
of our results along the crossover from the 
Heisenberg to the (nonintegrable) Bose- 
Hubbard regime remains to be fully under- 
stood (57). In this context, a comparison 
between the nonintegrable Bose-Hubbard model 
and the integrable Fermi-Hubbard model (53) 
could be of particular interest. Finally, the ob- 
servable we introduced to capture fluctuation 
effects—namely, the statistics of single shots 
of the polarization transfer—promises to be a 
powerful diagnostic tool for new phases of 
interacting quantum systems. A theory of this 
quantity already exists for the KPZ universality 
class, but developing a more general theory of 
such transport fluctuations will be an impor- 
tant task for future theoretical work. 

Note added in proof: During the comple- 
tion of this manuscript, we became aware of 
related work to observe superdiffusive transport 
in a long-range interacting ion chain (37). 
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QUANTUM SIMULATION 


Observing emergent hydrodynamics in a long-range 


quantum magnet 


M. K. Joshi, F. Kranzi*?, A. Schuckert®, I. Lovas*, C. Maier™®, R. Blatt)”, M. Knap***, C. F. Roos??* 


Identifying universal properties of nonequilibrium quantum states is a major challenge in modern physics. 
A fascinating prediction is that classical hydrodynamics emerges universally in the evolution of any 
interacting quantum system. We experimentally probed the quantum dynamics of 51 individually controlled 
ions, realizing a long-range interacting spin chain. By measuring space-time-resolved correlation functions 
in an infinite temperature state, we observed a whole family of hydrodynamic universality classes, ranging 
from normal diffusion to anomalous superdiffusion, that are described by Lévy flights. We extracted the 
transport coefficients of the hydrodynamic theory, reflecting the microscopic properties of the system. 
Our observations demonstrate the potential for engineered quantum systems to provide key insights into 
universal properties of nonequilibrium states of quantum matter. 


n quantum many-body systems at equilib- 
rium, the concept of universality asserts 
that microscopic details do not influence 
the nature of the emergent quantum phases 

of matter and their transitions. Rather, 
symmetries and topology determine the essen- 
tial macroscopic properties. By contrast, all 
scales, from low to high energies, are a priori 
relevant for quantum systems that are driven 
far from their thermal equilibrium. Recent ex- 
perimental progress in engineering coherent 
and interacting quantum systems made it pos- 
sible to create and explore exotic nonequili- 
brium states, which can exhibit unconventional 
relaxation dynamics (J-6), dynamical phases 
(7-12), and transitions between them (13, 14). 
Despite this wealth of observed quantum 
phenomena, a common anticipation is that 
classical hydrodynamics of a few conserved 
quantities emerges universally for any com- 
plex quantum system, as strong interactions 
entangle and effectively mix local degrees 
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of freedom (15, 16). However, verifying this 
assumption, and furthermore determining 
the nonuniversal transport coefficients of the 
emergent hydrodynamic theory for specific 
systems, is challenging. Recently, enormous 
efforts have been devoted to detect hydrody- 
namic transport in quantum gases (17-20) and 
condensed matter systems (27-24). Although 
transport is generally expected to be diffusive, 
a variety of largely unexplored classes of hy- 
drodynamics have been predicted theoretically, 
including anomalous subdiffusive (25, 26) and 
superdiffusive transport (27-29). 

In this work, we experimentally probed the 
dynamics of a long-range quantum magnet, 
realized in a quantum simulator of 51 indi- 
vidually controlled ions. We developed a pro- 
tocol to measure space- and time-resolved spin 
correlations in an engineered infinite tem- 
perature state, enabling us to experimentally 
establish that hydrodynamics emerges in the 
nonequilibrium quantum state. By tuning the 
long-range character of the interactions, we 
observed a whole family of hydrodynamical 
universality classes, ranging from normal dif- 
fusion to anomalous superdiffusion. 


Engineering a long-range quantum magnet 


The (pseudo-)spins of our quantum system are 
realized with two electronic states of the *°Ca* 


ion: 


Si,m = +}) as||) and |Dsj2,m = +5/2) 
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Fig. 1. Emergent hydrodynamics in a long-range quantum magnet. (A) In 
our system, the dynamics of a locally created spin excitation is effectively 
governed by a hydrodynamic theory of Lévy flights, which are random walks with 
step sizes drawn from a distribution with algebraic tails. This occasionally 
leads to long-distance jumps of the excitation. (B) The family of hydrodynamic 
universality classes characterized by a dynamical exponent z,, and a scaling 
function can be tuned by the power-law exponent a of the spin-spin interactions. 
(C) We measured infinite-temperature correlations by averaging over initial 
product states and preparing the central ion deterministically in the same state 
(blue box) [(32), section 2]. Picture of three exemplary initial states in a chain 
of *°Ca’* ions (|t), dark spots; ||), bright spots). White squares and circles indicate 


as |t). The quantum state of individual ions is 
controlled by a tightly focused, steerable laser 
beam capable of addressing any ion in the string, 
in conjunction with a laser beam that collect- 
ively interacts with the ions. A two-tone laser 
field realizes approximately power-law decaying 
Ising interactions between the (pseudo-)spins 
by off-resonantly coupling motional and elec- 
tronic degrees of freedom of the ion chain. 
Application of a strong transverse field ener- 
getically penalizes spin nonconserving contri- 
butions (30). The effective dynamics are then 
described by the long-range XY model 


: a 
B= Sage + 6; 6i') (1) 


where J and a determine the strength and 
the range of the spin-spin interaction, respec- 
tively (31). They can be tuned by varying the 
amplitude and frequency of the two-tone 
laser field. cf and 6; are the raising and 
lowering spin operators acting on the 7th spin, 
respectively. Further experimental details are 
given in the supplementary materials [(32), 
sections 1 to 4]. 


Theoretical expectations for late-time dynamics 


Conservation laws determine the macroscopic 
late-time dynamics of quantum systems. In 
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Time Jt 


the long-range XY model, the global U(1) 
symmetry implies the conservation of the 


total magnetization Le This constrains 


the relaxation of spin excitations. The long- 
range couplings enable transport of spin 
excitations over many lattice sites. As a con- 
sequence, at late times the effective dynamics 
are expected to be governed by classical Lévy 
flights, which describe random walks with 
long-distance jumps (Fig. 1A) (29, 33). The 
probability for a jump of distance ¢ in the 
Lévy flight is set by the power-law couplings 
of the long-range XY model as ~J?/ 2° [(32), 
section 8]. These long-distance jumps become 
irrelevant when the variance of the jump 
probability is finite. In our model, this is the 
case for a > 3/2, implying conventional diffu- 
sion [(32), section 8]. However, the variance 
diverges for a < 3/2, giving rise to a break- 
down of conventional diffusion. The hydro- 
dynamics of our system is therefore expected 
to be described by the anomalous diffusion 
equation 


aXe) =DeV*(oi) ——) 


where D, denotes the associated transport 
coefficient and 2, is the dynamical exponent 


the intended preparation of |t) and ||), respectively, achieved with 99% fidelity 
per ion. (D) Measured autocorrelations (green squares) for 51 ions and o = 1.1 
(gray line is a guide to the eye). Error bars, denoting the standard error of 

the mean, are smaller than the symbols [(32), section 5]. At short times 

(red shading), the spin excitation quickly relaxes to a local equilibrium state. 

At late times (blue shading), global conservation laws constrain the relaxation of 
spin excitations, leading to a slow power-law decay of the autocorrelations. 
(Insets) Autocorrelations on a double logarithmic scale for different values of 

a highlighting the tunable transport. Gray dashed lines are power laws with 

the predicted exponent from Lévy flights. Here, J = 248, 129, and 116 rad/s 

for a = 0.9 (51 ions), a = 1.1 (61 ions), and ao = 1.5 (25 ions), respectively. 


[(32), section 8]. The dynamical exponent 2, 
relates space and time and hence characterizes 
transport. For o > 3/2, transport is diffusive, 
and space and time are related by 2, = 2. For 
1/2 < ao < 3/2, we predict z, = 2a - 1 (29), 
leading to anomalous superdiffusion. For 
a < 1/2, the spins are so strongly connected 
that the lattice geometry becomes irrelevant, 
and a regime with z, = 0 is entered. Therefore, 
transport is strongly modified by the long- 
range character of the interactions. The pre- 
dicted dynamical phase diagram is shown in 
Fig. 1B. 


Experimental signatures of 
emergent hydrodynamics 


Hydrodynamic transport is most directly probed 
by creating a spin excitation at time ¢ = 0 in 
the center of the chain and tracking how it 
propagates in space and time, as measured 
by the unequal-time correlation function 


Gilt) = (865), (3) 


Over the course of the quantum dynamics, the 
initial excitation has to scatter off a highly 
excited background to quickly reach the hy- 
drodynamic regime. Therefore, we measured 
C,(@) at infinite temperatures T with magne- 
tization ~0. 
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Fig. 2. Spatial correlation profiles. (A and B) Spatially resolved correlations for 
(A) a = 1.5 for 25 ions and (B) o = 1.1 for 51 ions. The spatial correlations are 
measured at (left) short times, (middle) intermediate times, and (right) in the 
hydrodynamic late-time regime. STE, analytic short-time expansion; ED, exact 
diagonalization accessible only for the shorter chain of 25 ions. In the hydrodynamic 


Measuring spin correlations at different 
times and preparing an infinite temperature 
state are both difficult in isolated, engineered 
quantum systems. We overcame these chal- 
lenges by expressing the infinite temperature 
expectation value as an equally weighted trace 
over product states |) in the 6* basis [(32), 
section 10]. When preparing the central spin in 
the same polarization, we have 6¢|o) = +o) 
(Fig. 1C). The correlation function can then be 
directly evaluated as C;(t) ~ (0,|6;2(¢)|,) + 
(62|6;2() |) + .... It can be accurately ob- 
tained by sampling a finite number of initial 
product states [(32), section 10]. Yet for a 
comparatively small and experimentally acces- 
sible number of initial states, large statistical 
fluctuations are expected in the measured cor- 
relations. We removed these fluctuations by 
sampling pairs of conjugate product states, |) 
and |), where in the second configuration all 
spins are flipped except for the central one. 
For each pair of product states, initial cor- 
relations are unity in the center of the system 
and zero elsewhere, reproducing directly this 
property of the full trace (Fig. 1C, first two 
initial states). With this procedure, conver- 
gence is achieved even for a small number of 
initial product states [(32), section 10]. For o = 
0.9 and 1.1 and for o = 1.5, respectively, we 
created 60 and 120 initial product state con- 
figurations, each of which was realized, evolved, 
and measured 50 to 200 times. 

The autocorrelation function Co(¢) determines 
the residual excitation in the center of the 
chain. We measured C,(¢) for 51 ions (Fig. 1D). 
The relaxation dynamics occurred in multiple 
stages. A local equilibrium was reached already 
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after a few collisions, with the abundant exci- 
tations of the infinite temperature state. As a 
consequence, at short times the autocorrelation 
exhibited a rapidly damped oscillation. At later 
times, the system entered the hydrodynamic 
regime and eventually approached a global 
equilibrium through the slow rearrangement 
of spin excitations constrained by the conserved 
magnetization. The longest time to which we 
could probe transport was limited by the satu- 
ration of the correlation function set by the 
inverse system size. The power-law decay of 
the correlations became manifest on a double 
logarithmic scale (Fig. 1D, insets). Our exper- 
imental data are consistent with the hydro- 
dynamic theory of Eq. 2, which predicts a 
power-law exponent of —1/z, (Fig. 1D, insets, 
dashed gray lines). Hence, exploiting the ex- 
perimental control over the long-range expo- 
nent o enabled us to tune the transport from 
normal diffusion to anomalous superdiffusion. 

We obtained a more stringent test of the 
emergent hydrodynamics from the full spatial 
correlation profile (Fig. 2). We realized the 
long-range spin model for o = 1.5 with 25 ions 
(Fig. 2A) and for o = 1.1 with 51 ions (Fig. 2B). 
At early times (3.1 ms), the quantum dynamics 
are well-described by an analytic short-time 
expansion of the equations of motion [(32), 
section 9]. At intermediate times (14.1 ms), 
the excitation starts spreading through the 
system, but some quantum coherence remains, 
indicated by the spatial oscillations. For 25 ions, 
the measured dynamics compares well with 
results obtained from exact diagonalization, 
demonstrating the coherence of our quantum 
system [(32), section 7]. For the 51 ion chain, a 
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regime (right), the measured profiles are compared with predictions from Lévy 
flights. The spatial profile in (A) is compatible with a Gaussian (dashed) and in (B) with 
a Lorentzian (solid), as supported by the reduced y* values of the fit: (A) x = 3.9) 
X% = 1.3; (B) x? =1.4, x2 = 4.8 (obtained by fitting over the central 27 sites). 
Here, xf and xe are x” values for the Lorentzian and Gaussian fits, respectively. 


comparison to exact diagonalization was not 
possible because of the exponential growth of 
the Hilbert space, which reached a dimension 
of 2°" = 10”. 

The quantum dynamics effectively coarsens 
the system during the time evolution as spa- 
tial oscillations with longer wavelengths 
start to dominate. At the latest times shown 
(70 ms), interactions have averaged out quan- 
tum interference patterns, and the hydro- 
dynamic regime is entered. This became even 
more apparent in the space-time correlations 
(Fig. 3A). By contrast, a single excitation on 
top of a spin-polarized state cannot scatter, 
and the associated correlations exhibit coher- 
ent space-time oscillation patterns instead 
(Fig. 3B) (30). 

The theory of Lévy flights predicts the fol- 
lowing scaling form for the spatiotemporal 
profiles 


where F,, is given by the family of stable 
symmetric distributions (33). The scaling 
between space and time entering the dis- 
tribution F, can be deduced directly from 
the generalized hydrodynamic equation Eq. 2. 
The distribution Ff, cannot be expressed in 
terms of elementary functions, except for o = 
3/2, where a Gaussian indicates conventional 
diffusion, and o = 1, where a Lorentzian 
appears. 

The measured hydrodynamic profiles are 
compatible with a Lorentzian for o = 1.1 
(Fy-11 is still very close to a Lorentzian) 
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Fig. 3. Contrasting the infinite temperature background with the spin polarized background. (A) The 
deterministically prepared excitation of the central ion strongly interacts with all the other excitations of the 
infinite temperature background and slowly spreads through the system following the laws of classical 
hydrodynamics. (B) By contrast, a single excitation on top of the fully down-polarized state || ... |T| ...) has 
no other excitations to scatter off and therefore spreads freely, exhibiting quantum interference patterns. 
Data are measured for 25 ions with power-law exponent o = 1.5. 
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Fig. 4. Extracting transport coefficients. At late times, the correlations exhibit a self-similar scaling, 
relating space and time with a microscopic transport coefficient D,.. Shown are rescaled correlations 1,C;(t) 
as a function of rescaled space j/,, where t = (st), (A) a = 0.9, 51 ions. (B) a = 1.1, 51 ions. (C) a = 1.5, 
25 ions. Correlations are shown for times Jt > 5; darker colors correspond to later times. 


and with a Gaussian for o = 1.5 (y? analysis is 
provided in Fig. 2, right), which is in agree- 
ment with Eq. 4. 


Determining scaling exponents and 
coefficients 


Close to equilibrium phase transitions, cor- 
relation functions show self-similarity as a 
function of the distance to the critical point 
and can be characterized by a set of universal 
scaling exponents as well as scaling functions 
(34). Hydrodynamics predicts a self-similar 
scaling of correlations as a function of time, 
characterized by the dynamical scaling exponent 
&,, and the scaling function F’, (35). Whereas %, 
and F,, solely depend on the asymptotic decay 
of the interactions and can be predicted from 
purely hydrodynamic reasoning, the transport 
coefficient D, is not universal and depends on 
the full quantum many-body spectrum. It is 
therefore challenging to make predictions of 
D,, by using analytical or numerical methods. 
To verify Eq. 4, we rescaled C;,(¢) and space j as 
shown in Fig. 4, leading to a scaling collapse. 
By fitting the stable symmetric distributions to 
the collapsed data, we obtained a transport 
coefficient of D,/J = 0.57$7, 0.8193, and 
2.6'}? for a = 0.9, 1.1, and 15, respectively 
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[(32), section 6]. From exact diagonalization, 
we obtained D,/J = 1.9137 for a = 1.5. 


Discussion 


Large, coherent quantum systems with local 
control provide key insights into fundamental 
properties of nonequilibrium quantum states. 
In our experiment, we measured the emergent 
macroscopic hydrodynamics in an infinite tem- 
perature state of a strongly interacting spin 
chain—a regime that is notoriously challenging 
to describe by controlled analytical or numer- 
ical calculations. By measuring the full spatio- 
temporal profile of the hydrodynamic scaling 
functions, we experimentally established a 
tunable family of transport that ranges from con- 
ventional diffusion to anomalous superdiffusion. 
One prospect is to investigate the generality 
of the dynamical phase diagram (Fig. 1B) for 
other models with long-range interactions that 
conserve the total magnetization or charge. 
An example could be the long-range Heisenberg 
model, which can be realized in our system with 
suitably designed Floquet protocols (36). More- 
over, our tools for measuring high-temperature 
correlations can be readily applied to other 
quantum devices with local control, including 
quantum gas microscopes, Rydberg atom ar- 


rays, and superconducting qubits. Diffusion 
constants measured in controlled synthetic 
quantum matter could serve as a benchmark 
for newly developed numerical methods and 
for comparing different experimental plat- 
forms that realize the same model. Creating 
finite temperature states in the vicinity of a 
thermal phase transition with synthetic quan- 
tum matter would enable the study of emer- 
gent dynamic critical phenomena, describing 
a modified hydrodynamics in which additional 
conservation laws are introduced by the order 
parameter (37). Emergent hydrodynamics could 
also be tested for different initial states, such as 
certain product states (5). 

Note added in proof: During the comple- 
tion of this manuscript, we became aware of 
related work demonstrating superdiffusive 
transport in an integrable Heisenberg chain 
with nearest-neighbor superexchange inter- 
actions (38). 
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Paradoxical somatodendritic decoupling supports 
cortical plasticity during REM sleep 


Mattia Aime’, Niccold Calcini*”, Micaela Borsa‘, Tiago Campelo™”, Thomas Rusterholz*’, 


Andrea Sattin?, Tommaso Fellin?, Antoine Adamantidis 


1,25. 


Rapid eye movement (REM) sleep is associated with the consolidation of emotional memories. Yet, the 
underlying neocortical circuits and synaptic mechanisms remain unclear. We found that REM sleep is 
associated with a somatodendritic decoupling in pyramidal neurons of the prefrontal cortex. This 
decoupling reflects a shift of inhibitory balance between parvalbumin neuron—mediated somatic 
inhibition and vasoactive intestinal peptide—-mediated dendritic disinhibition, mostly driven by neurons 
from the central medial thalamus. REM-specific optogenetic suppression of dendritic activity led to a 
loss of danger-versus-safety discrimination during associative learning and a lack of synaptic plasticity, 
whereas optogenetic release of somatic inhibition resulted in enhanced discrimination and synaptic 
potentiation. Somatodendritic decoupling during REM sleep promotes opposite synaptic plasticity 
mechanisms that optimize emotional responses to future behavioral stressors. 


apid eye movement [(REM) or paradox- 

ical] sleep is characterized by a specific 

activation of neuronal populations that 

encode emotion and reward in animals 

(7-3) and humans (4-6). This REM sleep- 
specific activity is associated with molecular 
and cellular fingerprints of enhanced synaptic 
plasticity, including up-regulated gene expres- 
sion, long-term potentiation (LTP) (7-9), and 
heightened dendritic activity (J0, 1) in the 
cortex and hippocampus that ultimately con- 
solidates memory traces (72, 13). This is in 
contrast to the quiescence of brain structures, 
including the prefrontal cortex (PFC), that is 
associated with the pruning of dendritic spines 
(1, 14), synaptic weakening (15), and forgetting 
during REM sleep (16-18). These apparent con- 
troversies are the result of the heterogeneity 
of (sub)cellular activity in limbic (19, 20) or 
cortical structures (4, 11, 27-23), limited cellular 
resolution of human brain imaging technolo- 
gies (4-6), and the lack of selectivity of REM 
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sleep-deprivation procedures (8, 24). In this 
work, we investigated the mechanisms encoding 
emotional memories during REM sleep within 
dorsal PFC (dPFC) microcircuits—a critical brain 
structure for executive functions (25) and dis- 
crimination of danger from safety (26). 


Decoupling of somatodendritic activities in 
cortical neurons during REM sleep 


We first dissected dPFC microcircuit dynamics 
during REM sleep in head-restrained sleeping 
mice using simultaneous two-photon calcium 
imaging and electrophysiological recordings 
from chronic electroencephalogram (EEG) and 
electromyogram (EMG) electrodes (Fig. 1A). 
An adeno-associated virus encoding the cal- 
cium indicator GCaMP6 under the calmodulin 
kinase II promoter (AAV1-CaMKII-GCaMP6) 
was stereotactically injected into the dPFC 
area. The resulting GCaMP6 expression was 
specific to pyramidal (PYR) neurons of super- 
ficial layers (L2/3) (Fig. 1, A and B). Simultaneous 
recordings revealed a significant decrease of 
PYR somatic activity during REM sleep as 
compared with non-REM (NREM) sleep or 
wakefulness (Fig. 1, C and D; movie S1; and 
fig. S1; see materials and methods for data 
analysis and motion artifact detection). This 


decrease was consistent across all animals tested 
(fig. S2) and was confirmed using deconvolution 
analytical methods (materials and methods and 
fig. S3, A to D). Although few PYR somas re- 
mained active during REM sleep (8%), a substan- 
tial set of PYR cells was preferentially active 
during either NREM sleep (31%) or wakeful- 
ness (61%) (Fig. 1E). Distribution of single-PYR 
cell AF/Fo values (where AF/Fo is the ratio of 
the change in fluorescence to the baseline fluo- 
rescence) indicated low activity during REM 
sleep as compared with NREM sleep and wake- 
fulness (Fig. 1F). REM sleep was associated 
with a significant decrease in the frequency of 
calcium (Ca?*) events compared with those 
in other states (~18%; versus ~33% in NREM 
sleep or ~49% in wakefulness) (Fig. 1, G and 
H), although the integrals and amplitudes of 
Ca?* events were similar across states (fig. S4). 

We next tested whether apical dendrites of 
L2/3 dPFC PYR neurons were similarly modu- 
lated during REM sleep in mice cotransduced 
with an AAV1-Syn-flex-GCaMP6 and an AAV1- 
CaMKII-Cre to obtain a sparse labeling of 
dendrites (materials and methods; Fig. 1, I 
and J; and movie S1). Spatial propagation of 
GCaMP6s activity along individual dendrites 
was quantified for all GCaMP6s-expressing 
dendrites. Each identified branch was seg- 
mented into 0.5-um-wide rectangular regions 
of interest (ROIs), and AF/Fo traces of each 
ROI were extracted (materials and methods 
and Fig. 1, K and L). In contrast to PYR somas, 
PYR dendrites were highly active during REM 
sleep (Fig. 1M), as shown by the significant 
increase of the frequency of events spreading 
along the dendrites (>20 um) compared with 
events confined to small portions of the den- 
dritic branches (<20 ym). The average propa- 
gation of events along the dendrites was also 
higher during REM sleep (Fig. 1, N and O), 
providing evidence for a REM sleep-specific 
disinhibition of apical dendrites concomitant 
with a tonic inhibition of PYR somas. 


Shift in excitation/inhibition balance during 
REM sleep in cortical microcircuits 


To investigate the excitation/inhibition (E/I) 
balance during REM sleep, we recorded the 
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Fig. 1. Somatodendritic decoupling A 
during REM sleep in dPFC PYR 

neurons. (A) Schematic of the experi- 

mental procedure. (B) (Left) Expression 

profile of L2/3 GCaMP6 PYR neurons. 

Scale bar, 1 mm. (Right) GCaMP6 PYR 

neurons. Scale bar, 20 um. (C) EEG [red Cc 


AAV1-CaMKII-GCaMP6 


bregma +2.4mm 


=] 
‘ 
: 


E 


8 
9 
61 31% 


Max AF/F, 


yb ow BR Oo 


AF/Fy 


REM a 
NREM Sa 


wake 


squares are magnifications of NREM 


t hail | | ah tg 


ii 


una iady 


wake Ea 


m 4A Oo = 


200uV 


sleep (gray), REM sleep (fuchsia), and 


wake (light blue) traces], EMG, hypno- 
gram, color-coded power spectrum den- 
sity, and color-coded and raw AF/Fo 
activity profiles of a population of PYR 
somas. Freq., frequency. (D) Mean + SEM 


(black line) AF/Fo of PYR somas (n = 238 
neurons; m = 6 mice) during wake, 
NREM, and REM sleep. ***P < 0.001; 
one-way repeated measures analysis of 
variance (RM ANOVA) with Tukey post 
hoc test. (E) Percentages of PYR somas 
with maximal values of AF/Fo in wake 
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of AF/Fo values during wake, NREM, and 
REM sleep. (G) Frequency of somatic 
Ca** events during wake, NREM, and 
REM sleep. *P < 0.05; one-way RM 
ANOVA with Tukey post hoc test. 
Black lines represent mean values. 

(H) Distribution of somatic Ca** events 
in wake, NREM, and REM sleep. 

(I) Schematic of the experimental 
strategy. (J) Expression profile of 
GCaMP6 in the superficial layers of the 
dPFC. Scale bar, 1 mm. (K) (Left) 
GCaMP6 PYR dendrites. Scale bar, 
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gation of all dendrites during wake, NREM, and REM sleep. ***P < 0.001; one-way RM ANOVA with Tukey post hoc test. 


activity of parvalbumin (PV), vasoactive intes- 
tinal peptide (VIP), and somatostatin (SST) 
dPFC interneurons from PV-IRES-Cre, VIP- 
IRES-Cre, or SST-IRES-Cre mice, respectively, 
transduced with AAV1-Syn-flex-GCaMP6 in 
sleeping mice (materials and methods, Fig. 2A, 
and movie S1). PV interneurons are strategically 
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positioned to exert feed-forward inhibition by 
innervating principally the PYR somas (27). 
Accordingly, the activity of GCaMP6-expressing 
PV interneurons was significantly increased 
during REM sleep (Fig. 2, B and C), an obser- 
vation that is consistent with the elevated AF/Fo 
values of GCaMP6-expressing PV neurons 


during REM sleep (~72% of recorded neurons) 
(Fig. 2, D, E, and F). 

VIP interneurons have been described to 
preferentially inhibit SST y-aminobutyric 
acid (GABA)-releasing cells, which in turn 
synapse primarily onto the dendritic shaft 
of PYR neurons (28). GCaMP6-expressing VIP 
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(A) Expression profile of GCaMP6 interneurons in the dPFC. Scale bars, 1 mm 
100 um (bottom). (B) (Left) Schematic of the experimental procedure. 
(Right) GCaMP6 PV neurons. Scale bar, 30 wm. (C) Hypnogram and color-coded 
and raw AF/Fo traces of PV neurons. Red arrowheads indicate motion artifacts. 
(D) Mean + SEM (red line) AF/Fo of PV neurons (n = 137 neurons; m = 7 mice) 
during wake, NREM, and REM sleep. ***P < 0.001; one-way RM ANOVA with Tukey 
test. (E) Percentages of PV neurons with maximal values of AF/Fo in 
wake (light blue), NREM (gray), and REM (fuchsia) sleep. (F) Line (top) and scatter 
plots showing the distribution of PV neurons AF/Fo values during wake, 
NREM, and REM sleep. (G) (Left) Schematic of the experimental procedure. 
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and raw AF/Fo traces of VIP neurons. Red arrowheads indicate motion artifacts. 
(I) Mean + SEM (green line) AF/Fo of VIP neurons (n = 230 neurons; m = 5 mice) 
during wake, NREM, and REM sleep. **P < 0.01; ***P < 0.001; one-way RM 
ANOVA with Tukey post hoc test. (J) Percentages of PV neurons with maximal 
values of AF/Fo in wake (light blue), NREM (gray), and REM (fuchsia) sleep. 

(K) Line (top) and scatter (bottom) plots showing the distribution of VIP neurons 
AF/Fo values during wake, NREM, and REM sleep. (L) (Left) Schematic of the 
experimental procedure. (Right) GCaMP6 SST neurons. Scale bar, 30 wm. 

(M) Hypnogram and color-coded and raw AF/Fo traces of SST neu 
arrowheads indicate motion artifacts. (N) Mean + SEM (dark blue line) AF/Fo 

of SST neurons (n = 64 neurons; m = 3 mice) during wake, NREM, and REM sleep. 
***P < 0.001; one-way RM ANOVA with Tukey post hoc test. (0) Percentages 
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of SST neurons with maximal values of AF/Fo in wake (light blue), NREM (gray), 
and REM (fuchsia) sleep. (P) Line (top) and scatter (bottom) plots showing 

the distribution of SST neurons AF/Fo values during wake, NREM, and REM 
sleep. (Q) (Left) Mean + SEM activity profiles of dPFC neuronal populations 
during NREM-to-REM sleep transitions (black = PYR, red = PV, green = VIP, dark 
blue = SST). (Right) AF/Fo change + SEM across transitions. ***P < 0.001; one- 


interneurons showed a significant enhance- 
ment of AF/F values during REM sleep, with 
a large portion of VIP neurons (~56%) display- 
ing maximal activity during REM sleep (Fig. 2, 
G to K). By contrast, GCaMP6-expressing SST 
interneurons were mainly silent during REM 
sleep, except for a small subpopulation (~10%) 
(Fig. 2, L to P). These results were consistent 
across animals (fig. S5) and were confirmed 
with deconvolution analytical methods (fig. S3, 
E to H, for PV; fig. S3, I to L, for VIP; and fig. 
S3, M to P, for SST). 

The activity of GCaMP6-expressing VIP 
and PV neurons progressively increased 
over NREM-to-REM sleep transitions, whereas 
GCaMP6-expressing PYR somas and SST inter- 
neurons both showed opposite responses (Fig. 
2Q). Notably, the AF/Fo values of PYR somas 
during NREM sleep episodes were predictive 
of the next sleep-wake transition because AF/Fo 
values were lower in NREM sleep preced- 
ing REM sleep than AF/F> values preceding 
awakening (fig. S6). Upon awakening, the AF/Fo 
values of PYR and SST somas significantly in- 
creased, whereas those from VIP and PV somas 
progressively decreased (Fig. 2R). The activity 
of each population of neurons correlated with 
tonic (6 to 9 Hz) (Fig. 2S), but not phasic (9 to 
13 Hz), REM sleep. 


Thalamic control of PV neuron-mediated 
inhibition of PYR cell somas during REM sleep 


We next sought to identify the dPFC synaptic 
inputs responsible for this REM sleep-specific 
shift in E/I balance within dPFC microcircuits 
using a rabies virus-based anatomical map- 
ping and optogenetic circuit probing. To map 
the synaptic inputs to dPFC PV and VIP neu- 
rons, we injected AAV2/1-CMV-Flex-TVA- 
mCherry-2A-0G and RV-CMV-ENVA-EGFP 
(EGFP, enhanced green fluorescent protein) 
(29) in PV- and VIP-IRES-Cre mice, respec- 
tively (materials and methods). dPFC PV 
neurons were monosynaptically innervated 
by neurons located in the central medial 
thalamus (CM), the agranular retrosplenial 
cortex (RSA) and granular retrosplenial cor- 
tex (RSG), and the primary somatosensory 
barrel cortex (SIBF) (Fig. 3, A to E). Input cells 
to VIP neurons were scattered throughout the 
brain with a few neurons retrogradely labeled 
in the CM, whereas dPFC PYR neurons were 
monosynaptically contacted by neurons re- 
stricted to the S1 cortical area and the ven- 
trobasalis (VB) nucleus of the thalamus but 
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not the CM. (fig. S7). We then tested for mono- 
synaptic transmission in the CM@™“*"-qpFCPY 
circuit using in vitro patch-clamp recordings. 
Genetic targeting of a red fluorescent reporter 
(mCherry) to PV interneurons was achieved 
by stereotactic injection of an AA2-EF1-flex- 
mCherry in the dPFC of PV-IRES-Cre mice, 
whereas the expression of channelrhodopsin-2 
(ChR2) was targeted to excitatory CM neurons 
by stereotactic infusion of an AAV2-CaMKII- 
ChR2-eYFP (eYFP, enhanced yellow fluorescent 
protein) into the CM area (materials and 
methods and Fig. 3F). Optogenetic activation 
of ChR2-expressing CM neuron axon termi- 
nals in the dPFC elicited reliable excitatory 
postsynaptic currents (EPSCs) in ~41% of all 
recorded PV neurons with short latencies (Fig. 3, 
G to I, and fig. S8A). 

We next confirmed that the CM-dPFC path- 
way was selectively active during REM sleep 
using two-photon imaging in sleeping mice. 
To achieve this, AAV1-CaMKII-GCaMP6 virus 
was stereotactically injected in the CM of C57B16 
mice and a cranial window was implanted 
above the dPFC to allow for two-photon im- 
aging of GCaMP6-expressing CM axons in the 
dPFC (Fig. 3, J to L, and fig. S8B). The activity 
of CM axons in the PFC showed a significant 
enhancement during REM sleep (Fig. 3, M 
and N), with ~77% of all recorded projections 
displaying maximal AF/F> values during REM 
sleep (Fig. 30). This finding was consistent 
with the high activity profiles and high state- 
selectivity of a subset of the CM cell somas 
(17%) during REM sleep, as shown by two- 
photon imaging of CM neurons through a 
microendoscope lens (materials and methods 
and figs. S8C and S9). This heterogeneity sug- 
gested that only a subset of REM sleep-active 
CM neurons densely projects to the dPFC. 

To assess for a causal role for CM neurons 
in modulating the PV-PYR dPFC circuit during 
REM sleep, we optogenetically silenced the 
activity of CM neurons while conducting dual- 
color two-photon imaging of dPFC PYR and 
PV cell somas during REM sleep (Fig. 3, P and 
Q). PV-IRES-Cre animals were stereotactically 
injected with AAV1-CaMKII-SwichRCA in 
the CM and AAV-CaMKII-GCaMP6 (GCaMP6- 
expressing PYR somas) and AAV-Syn-Flex- 
NES-JRCaMP1b (RCaMP-expressing PV 
neurons) in the dPFC and were chronically 
implanted with EEG and EMG electrodes, an 
optical fiber placed above the CM area (with a 
45° angle because of space constraints), and a 


way ANOVA with Tukey post hoc test. (R) (Left) Mean + SEM activity profiles of 
dPFC neuronal populations during the transition from REM sleep to wake. (Right) 
AF/Fo change + SEM across transitions. ***P < 0.001; one-way ANOVA with 
Tukey post hoc test. (S) Correlation analysis between peak REM sleep theta 
frequency + SEM and the respective AF/Fo mean + SEM of all recorded neurons 


cranial window above the dPFC (materials 
and methods, fig. S8D, and Fig. 3P). Optogenetic 
silencing of SwichRCA-expressing CM neurons 
induced a reliable and significant increase of 
PYR soma activity (Fig. 3, R to T), concomitant 
to a significant reduction of PV neuron activities 
during REM sleep, as compared with the control 
groups (Fig. 3, R, U, and V). Notably, the activity 
of both PYR and PV neurons, upon optogenetic 
silencing of CM neurons during REM sleep, was 
similar to the levels observed during wakeful- 
ness (Fig. 3, T and V). 


VIP-mediated disinhibition of PYR dendrites 


To test whether a causal link exists between 
REM sleep-specific activity enhancement of 
dPFC VIP neurons and PYR dendrites, VIP- 
IRES-Cre animals were stereotactically injected 
with AAV1-EF1-DIO-SwichRCA and AAV- 
CaMKII-GCaMP6 (diluted to obtain sparse 
labeling; see materials and methods) in the 
dPFC and were chronically implanted with 
EEG and EMG electrodes and a cranial window 
(Fig. 4, A and B). Simultaneous two-photon 
dendritic imaging and REM sleep-selective 
optical silencing of local VIP neurons was 
performed in superficial layers of the dPFC 
(Fig. 4C and materials and methods). Opto- 
genetic silencing of SwichRCA-expressing 
VIP neurons induced a significant reduction 
of the frequency of both confined and spread- 
ing calcium events (Fig. 4, D and E, respec- 
tively), revealing a causal role for dPFC VIP 
neurons in the REM sleep-specific disinhibi- 
tion of PYR dendrites. 


Somatodendritic decoupling supports REM 
sleep-specific memory consolidation 


Next, we assessed whether REM sleep soma- 
todendritic decoupling is required for synaptic 
plasticity and the discrimination of dangerous 
versus safe environments using REM sleep- 
specific optogenetic silencing of PV and VIP 
neurons during an auditory fear-conditioning 
learning task (11, 30, 31). To optogenetically 
silence PV (i.e., somatic disinhibition) or VIP 
(ie., dendritic inhibition) neurons, we geneti- 
cally targeted the expression of archaerhodopsin 
(ArchT) to PV and VIP neurons by stereotactic 
injection of AAV2-EFlo-ArchT-eYFP or YFP tag 
(AAV2-EFla-eYFP) as controls in the dPFC from 
PV-IRES-Cre and VIP-IRES-Cre mice, respec- 
tively (Fig. 5A). SST neurons were not included 
in this experiment because they were found to 
be inactive during REM sleep (Fig. 2, L to P). 
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Fig. 3. Thalamic control A 4av2/1-cMv-Flex-TVA-mCherry-24-06 B J Flex-mCherry 
of REM sleep-specific | at=21 days \ 4 
PV-mediated inhibition RV-CMV-ENVA-EGFP 
of PYR somas. (A) Sche- 
matic of the experimental 
procedure. (B) Viral 
injection site of flex- 
mCherry and RV-GFP in 
the dPFC. Scale bars, 
500 um (leftmost panel) 
and 50 um (right three 
panels). (C) Expression 
profiles of the major brain 
regions projecting to a eae j ; 
dPFC PV neurons (CM, RSA oe ee mas 
and RSG, and SIBF). F AAV2-EF 1-flex-mCherry | 
Scale bars, 1 mm. (D) Total AAV2-CaMKII-ChR2 
distribution of identified 
neurons + SEM (n = 3 
mice) in the CM, RS, and 
S1. (E) Density of identified PV-IRES-Cre 
neurons + SEM in the CM, 
RS, and S1. (F) Schematic 
of the experimental 
strategy. (G@) Example 
trace of light-evoked EPSC 
in a PV neuron. Black 
traces indicate single 
evoked sweeps, the 

red trace indicates the REM. 
sweeps’ mean, and NREM | 
the blue vertical line 

indicates optical stimula- 
tion. (H) (Left) EPSC mean a 
+ SEM latency from light £154 
stimulation (n = 7 neu- 204 
rons; m = 3 mice). 254 
(Right) Mean + SEM P aav-camkil-ccaMPe 
amplitude of evoked asa eer 
EPSCs. (I) Percentages EEG 

of cells responsive to 
light-evoked stimulation. 
(J) Schematic of the 

experimental procedure. S eas 
(K) (Left) Expression 4 
profile of GCaMP6 in the 3 : 4 ) = 
CM. Scale bar, 1 mm. me i 
(Right) GCaMP6 CM 1 
axons. Scale bar, 30 um. 2 
(Bottom) Time-lapse 3 
activity profile of a CM. 7 4 0 o> ae | 
Scale bar, 0 wm. 7 no opto opto be no opto opto fio opto: opt " no opto opto 50 sec 

(L) Hypnogram and color- PYR PYR PV PV 

coded AF/Fo activity 

profiles of CM axons. (M) Mean AF/Fo + SEM (black line) of CM axons (n = 112 axons; m = 5 mice) during wake, NREM, and REM sleep. ***P < 0.001; one-way 
RM ANOVA with Tukey post hoc test. (N) Line (top) and scatter (bottom) plots showing the distribution of CM axons AF/F9 values during wake, NREM, 

and REM sleep. (0) Percentages of CM axons with maximal values of AF/Fo in wake (light blue), NREM (gray), and REM (fuchsia) sleep. (P) Schematic of the 
experimental procedure. (Q) GCaMP6 PYR neurons and RCaMP PV neurons. Scale bar, 30 um. (R) Hypnogram and color-coded AF/Fo activity profiles of 

dPFC PYR and PV somas. Blue and red vertical lines indicate 473-nm and 638-nm optical stimulations, respectively. (S) REM sleep AF/Fo + SEM of a group 

of PYR somas (n = 475 neurons; m = 8 mice) upon REM sleep-specific CM silencing (opto) versus a control group (no opto) (n = 238 neurons; m = 6 mice). 
***P < 0.001; Student's t test. (T) PYR AF/Fo change + SEM (difference between REM sleep and wake) in opto versus no opto. ***P < 0.001; Student's t test. 
(U) REM sleep AF/Fo + SEM of PV neurons (n = 92 neurons; m = 4 mice) upon REM sleep-specific CM silencing (opto) versus control (no opto) (n = 137 neurons; 
m =7 mice). ***P < 0.001; Student's t test. (V) PV AF/Fo change + SEM (difference between REM sleep and wake) in opto versus no opto. ***P < 0.001; 
Student's t test. 
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Fig. 4. VIP-mediated disinhibition of PYR A AAV1-CaMKII-GCaMP6 (diluted) Dendrites = 0.15, = 
dendrites. (A) Schematic of the experi- NE EE DIO UIRCA EEG im: = 
mental procedure. (B) GCaMP6 PYR den- 8 0.10 
drites (right) and SwichRCA VIP neurons = 5ae 
(left). Scale bar, 30 um. (©) Hypnogram and x 
AF/Fg traces of ROIs obtained from the e 0.00 1... 
segmentation of a single dendrite (repre- VIP-IRES-Cre z 
sented on the left). Scale bar, 10 um. C REM, # -0.05} 
(D) Dendritic confined event frequency aa | | | S — 
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Subsequently, VIP ArchT and PV ArchT mice, 
and their respective YFP controls, were sub- 
jected to a fear-conditioning discriminative 
task, during which mice progressively learned 
to discriminate a negative (safety, CS) from 
a positive (danger, CS*) conditional auditory 
stimuli (CS) (materials and methods). Mice 
showed no preference for either of the two 
conditional auditory stimuli (CS* and CS”) 
during initial habituation (day 1) (fig. S10). 
After conditioning (day 2), all mice showed a 
significant increase of freezing in response 
to the CS* (day 1 versus day 3) (fig. S10). Im- 
mediately after the behavioral conditioning 
session (day 2), prefrontal ArchT-expressing 
VIP or PV neurons, and their respective YFP 
controls, were optogenetically silenced bilat- 
erally selectively during every REM sleep epi- 
sode using online detection of EEG and EMG 
signals over a 4-hour time window (532-nm 
continuous light) (Fig. 5, A and B). REM 
sleep-specific silencing of VIP or PV neurons 
did not perturb sleep-wake cycle architecture 
(Fig. 5C) or the probability of state tran- 
sitions or sleep episode duration (fig. S11). 
The next day (day 3), mice were placed in a 
different context and exposed to CS* and 
CS” without any aversive stimuli to retest 
for discriminative memory retention (Fig. 5B 
and fig. S10). In contrast to the YFP control 
mice, we observed that REM sleep-specific 
optogenetic silencing of VIP Arch mice lost 
the ability to discriminate CS” from CS* 
(Fig. 5D) and showed a generalized freezing 
behavior (Fig. 5F), whereas optogenetic si- 
lencing of PV Arch mice led to a significant 
increase of CS~-versus-CS* discrimination 
(Fig. 5, E and F, and see fig. S10 for raw 
values). Accordingly, fear extinction was lower 
in PV Arch compared with control mice 
(fig. S12). 

Finally, we tested whether optogenetic si- 
lencing of ArchT-expressing PV or VIP neu- 
rons affects synaptic plasticity within local 
prefrontal circuits using ex vivo acute patch- 
clamp recordings after memory recall and 
REM sleep-specific optogenetic silencing 
(day 3) (Fig. 5, B and G). Consistent with the 
behavioral results, PFC PYR neurons dis- 
played a significant decrease of AMPA/NMDA 
(N-methyl-p-aspartate) ratio after REM sleep- 
silencing of ArchT-expressing VIP neurons, 
which suggests a suppression of local synaptic 
plasticity (Fig. 5H). By contrast, they exhibited 
a significant enhancement of AMPA/NMDA 
ratio after silencing of ArchT-expressing 
PV neurons as compared with YFP control, 
suggesting an increase of local synaptic plas- 
ticity (Fig. 51. Correlation analysis between 
AMPA/NMDA ratio and discrimination score 
further showed a segregation between VIP 
Arch mice (low AMPA/NMDA ratio and dis- 
crimination values), PV Arch mice (high AMPA/ 
NMDA ratio and discrimination values), and 
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their respective PV-YFP and VIP-YFP control 
groups (Fig. 5J). 


Discussion 


Collectively, these results demonstrate that the 
E/I balance within dPFC microcircuits is al- 
tered specifically during REM sleep through 
PV-mediated inhibition of PYR somas con- 
comitant to a VIP-dependent disinhibition of 
PYR dendritic activity, presumably because of 
the inhibition of SST neurons, as described 
ex vivo (32). Furthermore, the activity of PV 
neurons during REM sleep is controlled by 
inputs from a subset of mediodorsal thalamic 
neurons (33), as described in somatosensory 
(32, 34) and associative (33) thalamocortical 
circuits. This circuit model and the inputs to 
prefrontal interneurons may differ in deeper 
cortical layers and the rostrocaudal location 
of the input cell bodies, respectively, accord- 
ing to the neuromodulation associated with 
sleep-wake states and their transitions. 

A consequence of this somatodendritic de- 
coupling during REM sleep is the consolidation 
of emotional memories through a (paradoxical) 
increase of excitability of PYR dendrites con- 
comitant with a tonic inhibition of PYR somas, 
associated with a decrease of neuronal excit- 
ability. Although dendritic disinhibition during 
REM sleep provides a window to consolidate 
information acquired during wakefulness, 
somatic inhibition may preclude a top-down 
reinforcement of physiological and behavioral 
responses to fear in downstream pathways. 

A lack of somatic inhibition may interfere 
with local dendritic computation through back- 
propagating activity (35, 36) and facilitate non- 
linear integration of synaptic inputs during 
REM sleep. This may result in overconsolida- 
tion of emotional memories observed in post- 
traumatic stress disorders and other affective 
psychiatric and mood disorders often associated 
with REM sleep disturbances (37, 38). 

These findings are consistent with a simul- 
taneous upscaling and downscaling of synaptic 
plasticity at the PYR dendrites and cell bodies, 
respectively. Yet, whether similar NREM sleep- 
dependent mechanisms support memory con- 
solidation and synaptic plasticity remains to be 
investigated (39, 40). Somatodendritic decou- 
pling during REM sleep provides a mechanism 
that allows flexibility or adaptation to environ- 
mental changes and optimizes the discrimina- 
tion of danger versus safety and physiological 
responses to behavioral stressors. 
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FERROELECTRICS 


Highly enhanced ferroelectricity in HfO2-based 
ferroelectric thin film by light ion bombardment 


Seunghun Kang"+, Woo-Sung Jang}, Anna N. Morozovska*+, Owoong Kwon', Yeongrok Jin‘, 
Young-Hoon Kim”, Hagyoul Bae®, Chenxi Wang’, Sang-Hyeok Yang?, Alex Belianinov®”, 
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Sunkook Kim’, Hu Young Jeong”, Jaekwang Lee*, Olga S. Ovchinnikova’’, Jinseong Heo™*, 
Sergei V. Kalinin®?>*, Young-Min Kim*, Yunseok Kim?* 


Continuous advancement in nonvolatile and morphotropic beyond-Moore electronic devices requires 
integration of ferroelectric and semiconductor materials. The emergence of hafnium oxide (Hf02)—based 
ferroelectrics that are compatible with atomic-layer deposition has opened interesting and promising 
avenues of research. However, the origins of ferroelectricity and pathways to controlling it in HfO are 
still mysterious. We demonstrate that local helium (He) implantation can activate ferroelectricity in 
these materials. The possible competing mechanisms, including He ion-induced molar volume changes, 
vacancy redistribution, vacancy generation, and activation of vacancy mobility, are analyzed. These 
findings both reveal the origins of ferroelectricity in this system and open pathways for nanoengineered 


binary ferroelectrics. 


rogress toward ultradense oxide elec- 

tronics and applications, including non- 

volatile memories and beyond-Moore 

devices, necessitates the introduction 

of improved device paradigms and the 
development of strategies that utilize the 
wealth of functionalities of complex materials 
at extremely reduced dimensions. These in- 
clude a wide range of emergent properties, 
such as magnetoelectricity, ferroelectricity, 
flexoelectricity, and electrochromicity, that 
result from the intrinsic competition (or co- 
operation) of charge, spin orbital, and lattice 
degrees of freedom (1). The switching, logical, 
and memory devices require a special role for 
ferroelectricity, which refers to a material 
property of spontaneous electrical polariza- 
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tion that is reversible when an electric field is 
applied. Ferroelectricity has been studied for 
the application of next-generation electronic 
devices, such as ferroelectric field-effect tran- 
sistors (2, 3) and nonvolatile memories (4, 5), 
for over 40 years owing to their desirable phys- 
ical properties, including a fast-switching speed 
and long-term stability (6). However, despite 
these superior properties, ferroelectrics have 
practical limitations owing to scaling effects 
(7, 8) and a low compatibility with comple- 
mentary metal-oxide semiconductor (CMOS) 
processes (9). In particular, the ferroelectricity 
of conventional perovskite ferroelectrics sub- 
stantially drops at thicknesses below 50 nm, 
limiting their application in ultralarge-scale 
integrated electronics (7). Second, integration 
between classical ferroelectrics and Si and 
other semiconductors is limited by intrinsi- 
cally high oxygen chemical potential in the 
former, leading to formation of parasitic SiO, 
layers and degradation during classical pro- 
cessing. Hence, despite much effort, the goal 
of ferroelectric-semiconductor integration has 
remained elusive. 

The discovery of HfO,-based ferroelectrics 
with a fluorite structure in 2011 (0), e.g., 
Hfo.5Zro.s02 (HZO), broke the traditional 
paradigm and offers the promise of non- 
volatile and morphotropic beyond-Moore 
electronic devices (10-13). Unlike conventional 
ferroelectrics, HfO,-based ferroelectrics main- 
tain robust ferroelectricity (4), which would 
be viable even down to a fundamental barrier 
restricted by a unit cell dimension, and are 
highly compatible with the existing CMOS 
processes (75). However, the nature of ferro- 
electricity in these materials remains poorly 
understood owing to the complexity that 
arises from undefined structural and chemical 


factors. This poor understanding has in turn 
hindered the rapid development of facile con- 
trollability and large-scale uniformity. From a 
phenomenological perspective, HfO5-based ferro- 
electrics demand a “wake-up” process to increase 
ferroelectricity by electric field cycling (J6-20), 
suggesting the nonconventional nature of ferro- 
electric responses. Furthermore, the deposition 
parameters, such as doping, oxygen partial 
pressure and annealing conditions (21-24), 
and engineering perspective after surface mod- 
ifications such as plasma treatment (25, 26), 
were found to have a strong effect on the re- 
sulting ferroelectricity. 

Ferroelectricity in HfO.-based ferroelectrics 
can originate from multiple mechanisms, such 
as intrinsic ferroelectricity (27), electret-like 
responses associated with charge trapping at 
internal interfaces and intrinsic electrostriction 
(28). Moreover, recent studies have illustrated 
the important role of oxygen vacancies (29). This 
has gained particular attention in HfO,-based 
ferroelectrics because the vacancies stabilize 
the metastable ferroelectric orthorhombic 
(Pca2,) phase over the entire polycrystalline 
HZO thin film, resulting in an enhancement 
of ferroelectricity (23, 30). Oxygen vacancy re- 
distribution has been suggested as a possible 
mechanism for enhancement of the ferro- 
electricity driven by the wake-up effect (16-19). 
Despite these findings, controlling the ferro- 
electric response in HfO,-based ferroelectrics 
has remained elusive. 

We propose a strategy for highly enhanced 
and precisely controlled ferroelectricity in 
HfO.-based thin films by point defect engi- 
neering through light-ion bombardment. We 
also report possible competing mechanisms 
that include molar volume changes, vacancy 
distribution, number of vacancies, and vacancy 
mobility. We provide evidence for vacancy 
generation as the dominant mechanism be- 
hind the emergence of ferroelectric behavior 
in these materials. 

To quantitatively evaluate the effective en- 
hancement of ferroelectricity after ion beam 
treatment, we explore the remnant polariza- 
tion states and the hysteretic behavior via 
piezoresponse force microscopy (PFM) com- 
bined with various advanced techniques, such 
as the band excitation (BE) method and laser 
Doppler vibrometer (LDV). To understand the 
origin of the enhanced ferroelectricity, we ex- 
amined the atomic structure-chemistry rela- 
tionship in the polycrystalline HZO thin films 
before and after the light-ion treatment based 
on scanning transmission electron microscopy 
(STEM) combined with energy dispersive x-ray 
spectroscopy (EDX) and electron energy loss 
spectroscopy (EELS). We revealed that oxygen 
vacancies are homogeneously distributed over 
the optimally treated HZO thin film without 
the intervention of structural defects, which 
eventually leads to a global phase transition 
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Fig. 1. Design of experiments and spontaneous polarization properties. 

(A) Schematic representation of ion bombardment process. (B) Scheme for dose 
of He ion bombardment; inserted texts indicate the He ion doses (ions/cm’). 
(C) R-PFM amplitude image of HZO thin film after ion bombardment; green- 
dotted box in (B) indicates the measurement position of (C). (D) R-PFM 
amplitudes as a function of He ion dose extracted from (C) except for the central 
part of 10'8 ions/cm®. (E) R-PFM, (F) BE-PFM, and (G) LDV-PFM amplitude 
images for 10!” ions/cm? corresponding to the orange dotted box in (C). The 
PFM image is rotated 90° counterclockwise to align according to (B) and (C). 


(H) Spatial map of effective d33 after ion bombardment in (B); the orange and 
blue dashed boxes serve as visual guides corresponding to the positions with the 
He ion dose of 5 x 10" to 5 x 10" and 10° to 10” ions/cm”, respectively. 

(I) BE-PFM amplitude versus V,, depending on the He ion dose; BE-PFM 
amplitude was calibrated via the force-distance curve—based static calibration 
method (55). (J) Effective d33 as a function of He ion dose; the green line 

and area indicate the mean + SD deviation of the effective d33 in the pristine 
region, respectively. The effective d33 results of (H) and (J) were extracted using 
least-squares linear fitting, except for the noise floor region. 


to the ferroelectric orthorhombic phase whereas 
the nontreated or unoptimized thin films pre- 
sent a mixture of nonferroelectric and ferro- 
electric phases. This result implies a strong 
coupling of the oxygen vacancy and the re- 
sulting ferroic behavior of the HZO thin film. 
Thermodynamical interpretation can elucidate 
the phenomenological observation as possible 
competing mechanisms, including molar vol- 
ume changes, vacancy distribution, number of 
vacancies, and vacancy mobility. The method 
that we propose to modulate the density of the 
point defects could be a breakthrough for 
the practical application of ferroelectrics to 
next-generation memory devices via enhanced 
ferroelectricity. 
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Piezoresponse-dependent light ion 
bombardment 

We chose He ion bombardment as a strategy 
to explore the emergence of ferroelectricity, 
where He ions can become trapped by the 
lattice, leading to an increase in the molar 
volume, as well as generating oxygen vacan- 
cies. We performed ion bombardment on the 
HZO thin film as a function of the He ion 
dose using focused-beam He ion microscopy 
(Fig. 1A). To observe the change in the electric 
polarization as a function of defect density, we 
bombarded the thin film with various He ion 
doses (Fig. 1B). We determined the changes in 
the local polarization associated with the He 
ion dose using resonance PFM (R-PFM), which 


was measured with an AC voltage of a single 
frequency near the contact resonance (Fig. 1, 
C and D) (37). 

The regions with He ion doses below 10° 
ions/cm” do not demonstrate noticeable changes 
in the R-PFM amplitude, which represents the 
magnitude of the piezoresponse signal that in 
turn is related to polarization. Instead, these 
regions present a slight decrease in the piezo- 
response. This reduction may result from the 
defect formation by ion bombardment. How- 
ever, it may be also associated with carbon 
contamination at the surface because carbon 
deposition can be easily induced by the surface 
adsorbates of hydrocarbon molecules during 
ion bombardment (32-34) and, furthermore, 
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Fig. 2. Switchable polarization properties. (A) FORC-PFM hysteresis loop 
depending on the He ion dose. (B) Positive and (C) negative remnant 
piezoresponse versus Vac,max depending on the He ion dose. (D) Average 
piezoresponse hysteresis loop and the fitting result by the Preisach 


the atomic force microscopy (AFM) measure- 
ments can be sensitive to carbon contamina- 
tion. However, the regions with He ion doses 
above 10° ions/cm? show an increase in the 
R-PFM amplitude with an increase in the 
He ion dose. In particular, the R-PFM ampli- 
tude in the region with the He ion dose of 
10’” ions/cm? increased by approximately 
two times compared with the pristine re- 
gion, indicating substantial enhancement 
of ferroelectricity. Meanwhile, the central 
part of the region with the He ion dose of 
1018 ions/cm? shows a rapid decrease in the 
R-PFM amplitude and the milled surface (fig. 
SI) because an excessive He ion dose can lead 
to a crystallographic collapse via material 
sputtering. 

R-PFM can be affected by other contribu- 
tions, such as a resonance shift or nonferro- 
electric contributions (35-38), so we measured 
BE-PFM and LDV-PFM (Fig. 1, F and G) in 
addition to R-PFM (Fig. 1E). Because BE-PFM 
can track the resonance frequency, it allows 
one to minimize the error from the resonance 
shift (39), which can be associated with carbon 
contamination. Similar to the R-PFM results, 
BE-PFM amplitude also indicates an increase 
in the piezoresponse after ion bombardment. 
In addition, we confirmed that there was little 
change of mechanical properties in the ion- 
bombarded HZO thin film through BE-PFM 
(fig. S2). Meanwhile, recent studies have re- 
ported that nonferroelectric factors, such as 
the electrostatic effect, can be attributed to 
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the He ion dose. 


the PFM signal (35, 36). The difference in the 
surface potential according to the ion bom- 
bardment may affect the PFM signal due to 
electrostatic effects (40). 

Therefore, we conducted LDV-PFM mea- 
surements in the same region (Fig. 1G) to 
evaluate the PFM signal, excluding the long- 
range electrostatic effect because LDV-PFM is 
less sensitive to the electrostatic effect (40, 41). 
We observed a higher PFM amplitude in the 
region with a He ion dose of 10” ions/cm”. 
Furthermore, although the surface potential 
in the ion-bombarded region relaxed to the 
same value as that in the pristine region after 
a sufficient time elapsed, the R-PFM ampli- 
tude still exhibited a strong response in the 
ion-bombarded region (see fig. S3, 10’ ions/ 
cm”), indicating that the highly increased PFM 
signal originated primarily from the enhanced 
polarization by the ion bombardment. In ad- 
dition, to verify the reproducibility of the thin 
films deposited by another method, we mea- 
sured R-PFM images after ion bombardment 
on the HZO thin film deposited by the atomic 
layer deposition (fig. S4). We observed that 
light-ion bombardment continues to enhance 
the polarization in the HZO thin film, regard- 
less of the deposition method used. Overall, 
the R-PFM results indicate that ion bombard- 
ment can be used to control the polarization 
by controlling the He ion dose. 

We measured the effective d33 to explore 
the change in the polarization of HZO after 
ion bombardment, which may indicate the 
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out-of-plane piezoelectric constant of sponta- 
neous polarization, through an alternating- 
current voltage (V,.) amplitude sweep in the 
HZO thin film. We determined the variation 
of the effective d33; obtained from the map- 
ping result of the V,. amplitude sweep, i.e., 
the BE-PFM amplitude according to V,,, after 
ion bombardment (Fig. 1H). The regions that 
we measured correspond to the regions of 5 x 
10“* to 10" ions/cm? (Fig. 1, I and J). We ex- 
cluded the regions of 5 x 10"” to 10° ions/cm? 
because of topographic variations, including 
beam damage (fig. S1). The regions with He 
ion doses of 5 x 10'° to 10” ions/cm? clearly 
have a higher effective d3;, compared with 
that of the other regions (Fig. 1, H and I), 
although the difference according to the 
He ion dose was not observed owing to the 
equipment background noise below 0.3 Vac. 
We averaged the effective d33 results for a 
quantitative comparison as a function of the 
He ion dose (Fig. 1J). The regions with He ion 
doses below 10° ions/cm? show a slight de- 
crease in the effective d33. This decrease may 
be due to the defect formation by ion bom- 
bardment or the carbon contamination at the 
surface of the HZO thin film after ion bom- 
bardment (32-34). When the He ion dose is 
increased to 5 x10!° ions/cm? or more, the 
effective d33 is substantially increased com- 
pared with that of the pristine region. In 
particular, the effective dz, value of the 
10” ions/cm? region increased by 99% compared 
with that of the pristine region. Therefore, 
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Fig. 3. Chemical and structural analysis of crystal phases and defects. 

(A) STEM-EDX chemical maps of the constituent elements Hf, Zr, O, and Si in the 
HZO samples with different He ion dose conditions: P-HZO, L-HZO, and H-HZO, 
respectively. For elemental mapping, characteristic x-ray peaks of Hf L (7.898 keV), 
Zr K (15.744 keV), O K (0.525 keV), and Si K (1.739 keV) were selected. The 
weak Si contrast shown in the HZO thin film is attributed to the peak overlap 
of Si K and Hf M (1.644 keV) peaks. Scale bars, 10 nm. (B) Representative 
HAADF images of the three HZO samples. One of the nanograins in the three 
samples was aligned to a crystal zone-axis for atomic resolution imaging. 

(C) (Left) Enlarged atomic structure images for the regions marked by white boxes 
in (B); (right) simulated HAADF images corresponding to the experimental images. 
The Hf and Zr lattice models used for the HAADF image simulations were overlaid 
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on each image for comparison. The aligned nanograins in the three HZO samples 
were identified to be [001]-oriented monoclinic and [110]-oriented orthorhombic 
structures, respectively. (D) Comparison between the experimental and the 
simulated PACBED patterns obtained from each nanograin in the three HZO. 
samples. The simulated PACBED patterns were calculated from the three identified 
atomic structures. (E) Maps of a/b ratio in O K EELS for the three HZO 
samples. The maps were reconstructed by NLLS fitting process for the obtained 
EELS SI datasets. Insets represent O K EELS spectra averaged over the HZO 

film regions, respectively. The outside regions shaded with dark gray in each HZO 
sample were not considered for the evaluation of the a/b ratio because those 
regions are unrelated to the HZO film. The color scale bar on the rightmost side 
indicates the range of the measured a/b ratio in O K edge. 


the PFM results indicate that light-ion bom- 
bardment can enhance the polarization of 
HZO; furthermore, it can be used to locally 
control these enhancements. However, con- 
sidering the applicability of ferroelectrics to 
next-generation electronic devices, the switching 
characteristics of the polarization rather than a 
spontaneous polarization are more important. 
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Influence of light-ion bombardment on 
polarization switching 

To observe the changes in the polarization 
switching characteristics owing to He ion 
bombardment, we performed PFM hystere- 
sis loops based on the first-order reversal- 
curve (FORC) method (42). We measured 
the polarization switching behavior depend- 


ing on the maximum direct-current voltages 
(Vac,max) aS a function of the He ion dose 
(Fig. 2A). Whereas the coercive voltages re- 
lated to the polarization switching were in a 
similar range regardless of the ion bombard- 
ment and He ion dose, the piezoresponse 
indicating the intensity and direction of 
polarization was substantially increased at 
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different Ngo = (2, 1.5, 1, 0.5)-No (black, red, blue, and green curves, 
and hg is the decay factor of the defect concentration. (B) 
vacancy concentration Nj calculated for different film thicknesses 
and green curves, respectively). Parameters: T = 300 K, z 
corresponds to, e.g., No = 10% m® at W4, = — 10° m°® and q33 
image of HZO thin film with top electrodes. Red and blue circles indicate the poin 
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measurements by AFM-PUND were performed for the pristine and bombarded regions, respectively. The 
slightly larger height with rectangular shape indicates the region bombarded with 10!” ions/cm?. (D) P-V 
curves of the pristine (red) and bombarded (blue) regions. In the case of the pristine region, P-V curve 
measurements were performed before He ion bombardment. 


He ion doses above 10'° ions/cm. These re- 
sults can be more clearly observed through 
the remnant piezoresponse (Fig. 2, B and C). 
The positive remnant piezoresponse increases 
more steeply in the He ion doses of 5 x 10'° 
and 10” ions/cm? than that of other doses 
with an increasing Vacmax aS the polariza- 
tion direction shifts downward in the HZO 
thin film, except for the result at 1 Vacmax, 
where switching does not occur (Fig. 2B). In 
the region with a He ion dose of 10” ions/ 
cm’, the positive remnant piezoresponse at 
8 Vacmax Was increased by ~42% compared 
with that in the pristine region. The ion 
bombardment in the present case does not 
substantially affect leakage current (fig. S5). 
The piezoresponse was slightly decreased at 
He ion doses below 10’? ions/cm”, indicating 
that the slight decrease in both R-PFM ampli- 
tude and remnant piezoresponse may have 
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resulted from the defect formation by ion 
bombardment as well, rather than the carbon 
contamination. 

The PFM hysteresis loops that we obtained 
by the FORC method were well-fitted by the 
generalized Preisach model (Fig. 2D) (37). Our 
analysis suggests that He ion bombardment 
yields broader distribution of coercive fields 
than the internal bias fields for the average 
result, suggesting the nonuniform nature of 
the material. However, the distributions of the 
Preisach density for the He ion dose of 5 x 10'° 
to 10” ions/cm” are clearly distinguished from 
those for the other doses by the narrower dis- 
tribution of the coercive voltages and higher 
Preisach density than the coercive fields (Fig. 
2E), indicating more uniform and strengthened 
polarization switching responses. In addition, 
because of the increasing defect density, the 
interaction between charged defects and the 


spontaneous polarization is enhanced, leading 
to an extended distribution of internal bias 
voltages for the He ion dose of 5 x 10°° to 10”” 
ions/cm”. Apparently, a threshold value exists 
for the He ion dose to induce strengthened 
polarization switching and strong interactions. 
The ratio of the internal bias voltage width to 
the coercive voltage width and the irreversible 
amplitude both present an abrupt increment 
between the He ion doses of 10°° and 5 x 10'° 
ions/cm? (Fig. 2F). 

Overall, these results indicate the possibil- 
ity of using light-ion bombardment to highly 
enhance the ferroelectric polarization in HZO 
through defect engineering. Furthermore, an 
increase in the number of defects can occur 
owing to light-ion bombardment (figs. S6 and 
$7). Presumably, the enhanced ferroelectricity 
of the HZO thin film could be ascribed to the 
increased defect density inside the HZO thin 
film by the posttreatment of the He ion bom- 
bardment. Regardless, because the enhance- 
ment of ferroelectric polarization through 
defect engineering can be associated with 
several mechanisms, such as phase transition 
to the ferroelectric phase (16), we explored 
the origin of the ferroelectric polarization 
enhancement. 


Mechanism of highly enhanced ferroelectricity 
based on defect engineering 


To determine the dominant defect formed 
and the cause of the enhanced ferroelectricity 
in the treated HZO thin film, we performed a 
chemical analysis on the HZO samples with 
He ion doses of 5 x 10” and 10” ions/cm”, 
respectively, and compared those with the 
case of the pristine HZO thin film. The pristine 
HZO and the two HZO thin films with dif- 
ferent He ion doses are denoted as pristine 
(P-HZO), 5 x 10° ions/em? (L-HZO), and 
10'” ions/cm? (H-HZO), respectively. We 
constructed EDX elemental maps of Hf (red), 
Zr (yellow), O (green), and Si (cyan) for the 
three HZO samples of P-HZO, L-HZO, and 
H-HZO, respectively (Fig. 3A). The annular 
dark field (ADF) STEM images and the cor- 
responding EDX maps confirm that the thick- 
nesses of the HZO thin films are ~10 nm but 
appear to become larger with an increasing 
dose of He ions, which indicates an increase in 
molar volume. Comparing the high-resolution 
high-angle (HA)ADF STEM images of the 
three HZO samples (Fig. 3B), we observed 
layer thickening and structural disorder (seen 
as the increased fuzzy contrast in the image) in 
the H-HZO sample. This characteristic volume 
change is also apparent at the atomic scale in 
the comparison of planar spacings of the 
orthorhombic phase observed in the three 
HZO samples (fig. S8). The EDX maps of the 
three samples revealed that Hf and Zr are 
sufficiently mixed without elemental segregation, 
and the distribution of oxygen is wider than 
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that of Hf and Zr, extending into the Si sub- 
strate. This result indicates that a portion 
of oxygen in the HZO thin film was consumed 
to form a thin SiO, bottom layer on the Si 
substrate due to the interface reaction during 
the growth. Thus, the bottom part of the HZO 
layer would be oxygen deficient as compared 
to the upper part. 

To clarify the structural transition of the 
HZO thin films to the ferroelectric ortho- 
rhombic phase by He ion bombardment, we 
obtained atomic-scale HAADF STEM images 
for the three HZO samples and compared 
them with the simulated counterparts (Fig. 3C). 
Because the signal contrast in the HAADF im- 
aging mode is approximately scaled to the 
atomic number squared (Z~*), heavy Hf atoms 
(Z = 72) appear as bright dots, whereas light 
oxygen atoms (Z = 8) are hardly observed (43). 
Owing to the polycrystalline nature of the 
grown HZO thin films, we only observed the 
atomic structures of the thin films for nano- 
grains aligned along an arbitrary crystal zone 
axis. Hence, we conducted multiple STEM 
observations for each sample to determine 
the polymorphic phase distribution over the 
film of investigation. As a result, the P-HZO 
and L-HZO thin films present a mixture of 
monoclinic and orthorhombic phases, main- 
taining the monoclinic structure as the dom- 
inant phase. We identified the two atomic 
structures as orthorhombic and monoclinic 
phases observed in both HZO thin films (figs. S9, 
A and B). By contrast, we identified the nano- 
grains in the H-HZO thin film as the ferro- 
electric orthorhombic (Pca2,) phase in all the 
observations (fig. S9C). We collected an extra 
dataset confirming this behavior (fig. S10). 
Interphase boundaries between orthorhombic 
and monoclinic domains were occasionally 
spotted in P-HZO films (fig. SI1A) but were 
seldom observed in the orthorhombic grains 
in H-HZO films. Statistical size measure- 
ments indicate that the average grain size 
was slightly increased by ~2 nm after the He 
ion bombardment-induced transition to ortho- 
rhombic phase (fig. S11B). The HAADF STEM 
images of the P- and L-HZO samples show the 
[001]-oriented monoclinic structure, whereas 
that of the H-HZO samples represents the 
[110]-oriented orthorhombic structure (Fig. 3C). 
Position-averaged convergent beam electron 
diffraction (PACBED) analysis (44) was cor- 
respondingly performed on the regions of 
STEM observations to confirm those struc- 
tures. Consistently, the experimental PACBED 
patterns show a good match with the simu- 
lated PACBED patterns that were derived from 
the structures identified by the STEM observa- 
tions (Fig. 3D). Considering the multiple struc- 
tural observations, we conclude that the He ion 
bombardment into the pristine HZO thin film 
readily induces a homogeneous distribution of 
the ferroelectric orthorhombic phase over the 
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film. These results suggest that atomic defects 
such as oxygen vacancies and the structural 
disorder that were induced by He ion bom- 
bardment could play an important role in the 
favorable structural transition to the ferro- 
electric orthorhombic phase, thus resulting in 
the notable enhancement of ferroelectricity. 
To trace the distribution of oxygen vacan- 
cies inside the HZO thin films, we conducted 
an EELS spectrum imaging (SI) analysis of the 
O K edge for each HZO film (fig. S12). From the 
EELS SI datasets, we extracted the averaged 
O K spectra for the three HZO thin films (see 
insets in Fig. 3E). The O K energy-loss near- 
edge structure (ELNES) of HZO presents a 
notable doublet-peak shape denoted as a (peak 
at ~534 eV) and b (peak at ~538 eV) with an 
energy difference (AE;.,) of ~4 eV, which is 
attributed to the O 2p orbital hybridized with 
the Hf (or Zr) d,, and d,.,, orbitals, respectively 
(45). This characteristic peak splitting of the 
O K ELNES commonly appears in the HfO, 
polymorphs of monoclinic, tetragonal, and 
cubic phases with slight differences in the 
energy shift and peak broadening (45). Char- 
acteristically, the peak sharpness and inten- 
sity of peak a relative to peak b varies with the 
crystallinity and content of the oxygen vacan- 
cies in the HfO, film. Poor crystallinity causes 
a broadening of the double peak structure 
and also a decrease in the relative intensity of 
peak a, which should be considered when HZO 
thin films are grown at different annealing 
temperatures (46). Because the HZO samples 
were grown under the same thermal process- 
ing conditions, the most probable origin of 
the change in the O K ELNES could be the 
formation of the oxygen vacancies and/or 
the occurrence of structural disorder due to the 
He ion bombardment. The notable reduction 
in the a/b ratio from 0.60 to 0.43 in O K edges 
(Fig. 3E and fig. S12C) implies that the vacancy 
defects and structural disorder increased as the 
dose of He ion bombardment was increased 
to the optimal value of 10” ions/cm? (H-HZO). 
Because monoclinic and orthorhombic HZO 
structures have sevenfold coordinate metal 
ions, a similar broad doublet peak structure 
of the O K edges is observed (fig. S13A). How- 
ever, cubic and tetragonal HfO, structures with 
sixfold coordinate metal ions would present 
substantially sharper double-peaked O K 
ELNES owing to the higher crystal field sym- 
metry (46). Previous studies reported that the 
formation of oxygen vacancies reduces the 
first peak (a) intensity of the O K doublet 
edge when other factors are held constant 
(46, 47). Our density functional theory-based 
O K-edge simulations for the influence of oxy- 
gen vacancies represent the same tendency 
(figs. S13, B and C, and S14) (37). 
Therefore, we examined the change in the 
relative intensity of peak a in the O K ELNES 
to detect the oxygen vacancies inside the HZO 


thin films and mapped the a/b ratios corre- 
sponding to the entire film region by non- 
linear least squares (NLLS) fitting analysis 
(Fig. 3E). The resulting a/b ratio maps show 
that the oxygen vacancy content is gradually 
increased from P-HZO to H-HZO. Notably, 
the entire H-HZO thin film is revealed to be 
populated by oxygen vacancies, whereas the 
distribution of the oxygen vacancies is local- 
ized and nonuniform in the other two HZO 
samples (P-HZO and L-HZO); these results are 
similar to those previously obtained for a 
conventional HfO, sample (48). The statisti- 
cal histograms of the measured a/b ratios for 
the three HZO samples clearly indicate the 
varying population behavior of oxygen vacan- 
cies from the P- and L-HZO to the H-HZO 
thin films (fig. S12C). The Gaussian curve fits 
of the a/b distributions for the three HZO 
samples demonstrate that the H-HZO thin 
film is highly oxygen deficient compared with 
the other two HZO samples. The a/b ratio can 
also be decreased to some extent as the crys- 
tallinity becomes poor (46). Given the increase 
in structural disorder by the He ion bombard- 
ment, it is also expected that the a/b ratio 
would be correspondingly further reduced, 
making quantification of the vacancy content 
by EELS difficult. 

Theoretical calculations predicted that the 
injection of oxygen vacancies and substitutional 
doping into the HfO, thin film can promote a 
phase transition from the monoclinic (P2,/c) 
to the ferroelectric orthorhombic (Pca2,) phase 
(49, 50). However, this doping was presumably 
insufficient to stabilize the metastable ferro- 
electric phase over the predominant monoclinic 
phase (5/, 52). Experimental work demon- 
strated that the role of oxygen vacancy is more 
important in stabilizing the ferroelectric phase, 
prompting efforts for developing a facile route 
to controlling the concentration of the oxygen 
vacancy in the hafnia film (23, 30). On the basis 
of the STEM-EELS results, we confirmed that 
He ion bombardment into the HZO thin film 
efficiently triggers the formation of oxygen 
vacancy and increases its density and dis- 
tribution with an increasing He ion dose before 
causing structural damage (= ~10"* ions/cm”). 
Thus, to explore why the ferroelectric ortho- 
rhombic phase is stabilized as the oxygen 
vacancies are gradually introduced, we sur- 
veyed the change in total energy difference 
between the monoclinic and orthorhombic 
phases up to the vacancy content of 25% (fig. 
S13D) (31). Unexpectedly, we found that a 
threshold concentration (>20%) of oxygen 
vacancy is required to fully stabilize the 
ferroelectric orthorhombic phase. Annular 
bright-field STEM imaging confirmed that 
the oxygen-deficient ferroelectric orthorhombic 
structure is stably sustained (fig. SI3E). A slight 
reduction in both R-PFM amplitude and 
remnant piezoresponse at He ion doses below 
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10"° ions/cm? may also be associated with the 
threshold concentration. 

Nevertheless, because ferroelectric behavior 
can have multiple causes, we further analyze 
the possible thermodynamic mechanisms, in- 
cluding a direct change in molar volume, 
emergence of ferroelectricity, vacancy forma- 
tion, and vacancy redistribution. The free 
energy contribution Gp; describing the ferro- 
electric order-disorder-type transition can be 
expressed using the macroscopic polarization 
vector components (28): 
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where PE denotes electrical polarization; the 
polarization components are normalized as 
D= 5, Pa=Nad; d is the concentration- 
independent average value of an elementary 
dipole moment; the concentration Ng can 
be related with the average concentration 
(8Na(7r)) of different vacancies and/or other 
defects; J is the effective interaction constant 
that is negative for the considered case of 
improper ferroelectricity; J, is the effective 
nonlinearity coefficient; 5 is the positive 
gradient coefficient determined by the corre- 
lation between the dipole moments and by the 
contribution of the spatial gradients induced 
by the surface, as well as by the inhomo- 
geneities inherent to the film; and sx is the 
Boltzmann constant. We consider that the film 
surfaces are defect blocking, so all O, Hf, and 
Zr vacancies inside the film are time indepen- 
dent. Consequently, the average concentration 
of the vacancies is also time independent and 
equal to the average value calculated in the 
initial moment of time. Inserting elastic fields, 
created by different defects, into the (Hf,Zr)O. 
Gibbs potential and its subsequent expan- 
sion on the polarization powers leads to 
renormalization of the coefficient a by the 
electrostriction coupling with the Vegard 
expansion tensor, and the depolarization field 
effect [see (28, 31)]. Then, a local polar state 
can emerge for an out-of-plane polarization 
component p = o when the average con- 
centration of defects (Na) is high enough, 
and the positive terms proportional to A/heo 
and 6/Ah are small enough, where A is the 
effective screening length produced by the 
exposed free surface and/or imperfect bottom 
electrode; h is the (Hf,Zr)O. film thickness; 
and €o is the universal dielectric constant. 

We show typical dependences of the spon- 
taneous out-of-plane polarization p on the film 
thickness and vacancy concentration Ng, 


Grr = Na 
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(Fig. 4, A and B). The p value decreases rapidly 
with increasing film thickness h (Fig. 4A). 
Further, the p value increases rapidly and then 
saturates with increasing vacancy concentra- 
tion Nz (Fig. 4B). The appearance of ferro- 
electricity is a typical size-induced effect because 
the second (positive) and third (negative) terms 
in eq. S10 for & = 3 are thickness dependent. 
In particular, the third term increases with h 
as j, because (Nz) = Nao “(1 — exp(—}2)) in 
accordance with eq. S9, where hg is the decay 
factor of the defect concentration. Small sec- 
ond and large third terms are required for the 
negative ((of,)). Making A < 0.1 nm corre- 
sponds to a high screening degree by, e.g., 
either conducting electrodes or ambient free 
carriers, and using a small gradient coeffi- 
cient (i.e.,5 < 10-! C? m? J) makes the positive 
terms appear smaller than the negative one. 
Thus, the third term explains the size effect 
(Fig. 4, A and B). Assuming that the polar state 
occurs at Nz, = 107° to 107° m™?, we need to 
minimize exposure to He; Nye = 10“ ions/cm” 
can create such a defect concentration over 
the already available defect “background.” Our 
analysis suggests that the emergence of ferro- 
electricity in He ion-bombarded hafnia can be 
driven by a synergy of several factors. In ac- 
cordance with eq. S10, emergence of ferro- 
electricity via renormalization of ((a%,)) 
requires a dominant third term. This in turn 
requires high negative values of electrostriction 
coupling and large product (W%,)(Nq). Of 
these, the electrostriction coefficient g33 is of 
the same order for most ferroics. Hence, the 
product of the defect concentration and Vegard 
expansion coefficient (W<,) emerges as a pre- 
dominant factor controlling ferroelectricity in 
this material. Although we consider a quasi- 
equilibrium static configuration, the modeling 
charge defect transport in HfO.-based ferro- 
electrics in a nonequilibrium state could be an 
interesting further work for electrical cycling. 

Finally, although the STEM and theoretical 
results showed that polarization enhancement 
based on defect redistribution and phase tran- 
sition could be one of the possible mechanisms, 
electrical measurements are still necessary be- 
cause nonferroelectric factors such as electro- 
static effects might contribute to the obtained 
PFM signal (Figs. 1 and 2). Thus, we directly 
observed the remnant polarization charge of 
the electrode sample through an AFM-based 
positive-up-negative-down (i.e., AFM-PUND) 
method (53, 54). The remnant polarization 
charge density clearly increased after the He 
ion bombardment at a dose of 10” ions/cm? 
(Fig. 4, C and D). 


Conclusions 


We have demonstrated the high enhancement 
of ferroelectricity in HZO thin films based on 
defect engineering by He ion bombardment. 
Our PFM results indicate that the He ion 


bombardment substantially enhanced the 
ferroelectricity in the HZO thin film. In par- 
ticular, the resonance PFM results indicate 
that the amplitude of piezoresponse in the 
ion-bombarded region of 10’ ions/cm? in- 
creased by approximately twofold compared 
with that in the pristine region. This implies 
that the fine modulation of light-ion bom- 
bardment can substantially enhance the po- 
larization of HZO. Furthermore, the STEM 
and EELS results show that the large en- 
hancement of ferroelectricity by ion bom- 
bardment can be caused by a homogeneous 
distribution of oxygen vacancies and a phase 
transition to the ferroelectric phase. Further- 
more, the theoretical calculations for thermo- 
dynamic mechanism allow us to explore possible 
mechanisms for the enhancement of the polar- 
ization by ion bombardment. These results, 
regarding the variation of ferroelectricity 
through defect engineering based on ion 
bombardment, suggest additional possibilities 
for ferroelectricity enhancement in HfO,-based 
ferroelectrics, including HZO. Thus, He ion 
bombardment can be a facile technique for 
stabilizing the ferroelectric orthorhombic 
phase. Furthermore, this study suggests that 
highly enhanced ferroelectricity can be selec- 
tively obtained by controlling light-ion bombard- 
ment with a high spatial resolution. Ultimately, 
this approach can be directly applied to a 
semiconductor process without structural mod- 
ification and, thus, can increase its applicability 
in next-generation electronic devices, such as 
ultrascaled ferroelectrics-based transistors and 
memories. 
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NANOCHEMISTRY 


Ultrafast water permeation through nanochannels 
with a densely fluorous interior surface 


Yoshimitsu Itoh?*, Shuo Chen’, Ryota, Hirahara'{, Takeshi Konda‘+, Tsubasa Aoki‘, Takumi Ueda’, 
Ichio Shimada*“*, James J. Cannon®°, Cheng Shao°, Junichiro Shiomi°, Kazuhito V. Tabata’, 


Hiroyuki Noji’, Kohei Sato’*+, Takuzo Aida’®* 


Ultrafast water permeation in aquaporins is promoted by their hydrophobic interior surface. 
Polytetrafluoroethylene has a dense fluorine surface, leading to its strong water repellence. We report a 
series of fluorous oligoamide nanorings with interior diameters ranging from 0.9 to 1.9 nanometers. 
These nanorings undergo supramolecular polymerization in phospholipid bilayer membranes to form 
fluorous nanochannels, the interior walls of which are densely covered with fluorine atoms. The 
nanochannel with the smallest diameter exhibits a water permeation flux that is two orders of magnitude 
greater than those of aquaporins and carbon nanotubes. The proposed nanochannel exhibits 

negligible chloride ion (CI") permeability caused by a powerful electrostatic barrier provided by the 
electrostatically negative fluorous interior surface. Thus, this nanochannel is expected to show nearly 


perfect salt reflectance for desalination. 


he fluorous polymer polytetrafluoroethylene 
(PTFE) provides a distinct superhydro- 


phobic surface on which densely packed 
carbon-fluorine (C-F) bonds repel water 
(2). Nevertheless, individual C-F bonds 
are polar and can interact electrostatically 
with polar functional groups to form hydrogen 
bonds (H-bonds) (2). This notable bilateral 
characteristic—known as polar hydrophobic- 
ity (3)—can be attributed to the properties of 
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fluorine, which include the largest electro- 
negativity and the second smallest atomic size 
among all elements. The interaction between 
fluorous surfaces and water has been widely 
investigated (4). Recent reports based on Raman 
spectroscopy showed that water clusters in the 
vicinity of fluorous compounds break to yield 
many hydroxy dangling bonds, whereas con- 
siderably fewer dangling bonds are gener- 
ated with hydrocarbon analogs (5, 6). This 
observation indicates that a nanochannel with 
a PTFE-like superhydrophobic interior surface 
can suppress the formation of water clusters 
that likely diffuse more sluggishly than non- 
clustered water molecules. 

We developed a series of fluorous oligoa- 
mide nanorings (“"NR,s) that undergo supra- 
molecular polymerization to yield nanochannels 
(F™NC,s) with different interior diameters 
(Fig. 1A). To design ""NR,s and '™NC,s, we 
exploited the polar and hydrophobic natures 
of C-F bonds. Because a C-F bond is highly 
polar but is difficult to polarize atomically (3), 
the C-F bond in '™NR,s can interact electro- 
statically with adjacent polar amide groups 
to form H-bonds (2). This aspect renders the 
macrocyclic backbone rigid and causes the 
C-F bonds to point inward. When '™NR,s 
are able to be supramolecularly polymerized 
in hydrocarbon media to solvophobically form 
fluorous nanochannels (“"NC,s) (Fig. 1B), 
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their interior surfaces are densely covered 
with fluorine atoms and thus can break water 
clusters (5, 6). 

We computationally investigated the in- 
fluence of the hydrophobicity of the interior 
nanochannel surface on the water clusters 
(7-9) (Fig. 1, C and E). In a molecular dynamics 
(MD) simulation based on a Lennard-Jones 
wall with different scaling factors, we changed 
the hydrophobicity of the interior surface 
(0) (Fig. 1, D and E). As expected from the 
behavior of the water clusters near the fluori- 
nated compounds (Fig. 1C), we observed the 
emergence of a larger number of hydroxy 
dangling bonds as the interior surface became 
more hydrophobic (Fig. 1D). Subsequently, we 
investigated the influence of hydrophobicity 
on the water flow rate. As shown in Fig. 1E, as 
the hydrophobicity of the wall increased, the 
flow rate also increased. 

A series of fluorous nanorings, *’’NR4, 
FIENR., T'SNRg, and *!’NRg (Fig. 1A), were 
synthesized through an amide condensation 
reaction (10). Two starting compounds—3,5- 
diamino-4-fluorobenzamide derivative (1) and 
2,3-difluoroterephthalic acid (2)—were syn- 
thesized according to previously reported 
methods (J0). Among the considered amide 
condensation reagents, only pentafluorophenyl 
diphenylphosphinate (FDPP) (7D) allowed for 
compounds 1 and 2 to be connected under 
microwave irradiation. To isolate "’NR,, "’NR;, 
and *!8NR, from the reaction mixtures, we 
conducted reprecipitation, preparative recy- 
cling silica gel column chromatography, and 
gel permeation chromatography to remove 
large amounts of linear oligomers. The fluo- 
rous nanorings were characterized by NMR 
spectroscopy (figs. S1 to $12) and high-resolution 
matrix-assisted laser desorption/ionization 
time of flight (MALDI-TOF) mass spectrom- 
etry (figs. S16 to S18). We also succeeded in 
synthesizing the partially fluorinated ver- 
sion ('?NR,) using 3,5-diaminobenzamide 
and 2 under conditions analogous to those 
described above (figs. S13 to S15 and S19). The 
nonfluorous version could not be obtained 
no matter which optimized conditions were 
employed. 

The core structures of ""NR,s were opti- 
mized by density functional theory (DFT) 
calculations (figs. S20, S22, S24, and S26). 
The CPK representation of “!’NR,, exhibited 
a planar conformation (Fig. 2A), whereas the 
other nanorings exhibited a skewed confor- 
mation (Fig. 2, B to D). Although terephthala- 
mide units are rotatable, all the fluorine atoms 
in the optimized structures were suggested 
to point inward to form intramolecular 
“C-F-.--H-N” H-bonds. The energy gains asso- 
ciated with the intramolecular H-bonding 
interaction for "NR, *°NR;, “NR, and 
Fl2NR, were calculated to be -36.2, -47.1, -44.4, 
and -47.9 kcal mol”, respectively (figs. S21, 
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Fig. 1. Series of fluorous nanorings and formation of transmembrane fluorous nanochannels. 

(A) Molecular structures of a series of fluorous nanorings: “NR, °NRs, “NR, and NRg. The interior 
diameters of the nanorings were calculated on the basis on the radial water densities obtained by MD 
simulation of the water permeation through the nanochannel (fig. S37). The wall position was defined as 
the point at which the water density was 1% of its maximum value. (B) Schematic representation of 

the supramolecularly polymerized "NR, into a fluorous nanochannel (™NC,,) embedded in a vesicular 
phospholipid bilayer membrane. (C) Schematic representation of the flow of water clusters in hydrophilic 
and hydrophobic nanochannels. (D) Dangling bond distribution of water molecules in a virtual Lennard-Jones 
(LJ) channel with a diameter of 1.76 nm. The hydrophobicity of the LJ channel was controlled by the scaling 
factor (10). (E) Flow rates of water molecules in the LJ channel at different hydrophobicity levels. 


$23, S25, and S27). These results indicate that 
the conformational aspect of ""NR,s is de- 
termined by an H-bonding circular array com- 
posed of intramolecular “C-F---H-N” bonds. In 
this context, when we heated a deuterated di- 
methyl sulfoxide (DMSO-d,) solution of "NR, 


(2.0 mM) from 25° to 65°C, all the signals per- 
taining to Ar-F in the °F NMR spectra ex- 
hibited a maximum upfield shift of 0.5 parts 
per million (ppm) (fig. $32, A to D), which is 
nearly one order of magnitude smaller than 
those reported for linear analogs of similar 
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Fig. 2. Formation of fluorous 
nanochannels. (A to D) CPK 
representations of (A) "NRy, 
(B) °NRs, (C) “8NRg, and 
(D) *!*NR¢ nanorings obtained 
by DFT calculations at the 
B3LYP/6-31G* level of theory 
(figs. S20, S22, S24, and 

S26, respectively). (E to 

H) Electrostatic potential maps 
of nanorings (E) F!?NRz, 

(F) SNRs, (G) M8NRe, and 

(H) F2NRe obtained by 

DFT calculations at the B3LYP/ 
6-31G* level of theory (figs. 
$20, S22, S24, and S26, 
respectively). The electrostatic 
potentials were mapped on 
the surface with an electron 
density of 0.0004 units. The 
red and blue surfaces represent 
negative and positive regions 
of the potential, respectively. 

(I to L) CPK representations I 
of (I) NCy, (J) NCs, 

(K) “®NCg, and (L) NC, 
nanotubes formed by supra- 
molecular polymerization 

of "*NRa, SNRs, 8NRe, and 
Fl2NRe, respectively. Each 
model was constructed on the 
basis of the structure of NR, 
obtained through DFT calcula- 
tions at the B3LYP/6-31G* 

evel of theory (figs. S20, S22, 
S24, and S26). The linear 

alkyl chains were separately 
optimized and combined with 
DFT-optimized F™NR,,. (M to 

P) TEM micrograph of an air- 
dried dispersion of supramo- 
lecularly polymerized (M) 
FNRa, (N) FIBNRs, (0) FIBNIR., 
and (P) "@NRe, prepared by 
heating the suspensions in 
DMSO/pentadecane at 50°C for 
5 min followed by cooling to 
room temperature. The sample 
was stained with uranyl acetate. 


structures (2). Even when the sample solutions 
were diluted by a factor of 1000 with DMSO-d, 
from 2.0 mM to 2.0 uM at 25°C, no appreciable 
spectral change was observed (fig. S32, E to H). 
The observed NMR spectral features were con- 
sistent with those obtained computationally. 
To confirm that the fluorous nanorings FENRA, 
FIENR,, “SNR, and ™’NR, formed nanochan- 
nels, their DMSO solutions were added to 
pentadecane and the mixtures were annealed 
at 50°C for 5 min and subjected to transmis- 
sion electron microscopy (TEM) after being 


740 13 MAY 2022 + VOL 376 ISSUE 6594 


6.3 30.1 
kcal mot) a mol? 


FI2NC,4 
5.2 nm 


J FISNCs; 


payee 


allowed to cool to room temperature (J0). As 
shown in Fig. 2, M to P, TEM imaging could 
successfully visualize the presence of extreme- 
ly long nanotubes with uniform diameters of 
5.1, 6.0, 6.4, and 6.4 nm for "’’NR,, "°NR;, 
FISNR,, and ‘NR, respectively, consistent 
with those estimated from a molecular model- 
ing study (10) (Fig. 2, I to L). These results in- 
dicated that ""NR,,s (Fig. 1A) can form fluorous 
nanochannels FMNC,S in the vesicular phos- 
pholipid bilayer membranes (Figs. 1B and 2, I 
to L) employed for the permeation study. 


FIBNCs 


OK 


We performed molecular dynamics sim- 
ulations of water permeation using the mo- 
lecular structures of FMNC,,S. The results 
highlighted that ""NC,s can permeate water 
molecules considerably faster than their non- 
fluorous versions (figs. S35 and S36). To 
evaluate the water permeability, we initially 
embedded *™NC,s in 1,2-dioleoyl-sn-glycero- 
3-phosphocholine (DOPC) vesicles and used 
a stopped-flow method in conjunction with 
light scattering (figs. S38 and S39) to analyze 
the corresponding water permeation profiles. 
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Fig. 3. Stopped-flow fluo- A 
rescence measurements 


500 mM NaCl in HEPES 
Buffer without Vesicles 


o 
a 


to evaluate the water per- 
meation kinetics through 
fluorous nanochannels. 
(A) Schematic representa- 
tion of the stopped-flow 
fluorescence measurement 
setup. A HEPES buffer 
solution of NaCl ([HEPES] = 
10 mM [NaCl] = 500 mM) 
was added to a HEPES 
buffer dispersion of CF- 
enclosing DPPC vesicles 
((HEPES] = 10 mM, [NaCl] = 
100 mM). Osmotic pres- 
sure was generated in 

the vesicular phospholipid 
bilayer membrane because 
the NaCl concentrations 
in the exterior (300 mM) 
and interior (100 mM) 
environments of the vesicle 
were different. Subse- 
quently, the vesicles were 
expected to release water 
molecules through the 
channels and shrink. This 
volume shrinkage event was 
monitored through the 
fluorescence quenching of 
CF, owing to its increased 
concentration in the 
interior environment of the 
vesicles. (B) Representative 
stopped-flow fluorescence 
decay profiles of CF 
enclosed in DPPC vesicles 
at 25°C in the presence 
(red, x = 0.0018) and 
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absence (black) of “NR, in the vesicular phospholipid bilayer membrane. Eight decay profiles were averaged. (C to F) Dependence of the percent mole fractions (,) 
of (C) FNR,, (D) P5NRs, (E) FSNRe, and (F) *@NR¢ in DPPC on the osmotic water permeability coefficients P; of the corresponding vesicles at 25°C. The error 
bars represent + 1 S.D. from the averaged values calculated from 3 to 10 independent experiments. 


Although this method has occasionally been 
used to analyze the water permeation profiles 
of CNTs (12) and AQPs (73), we observed a 
large deviation in the permeation profiles (fig. 
$38, C to F). Because this deviation was likely 
caused by spontaneous water leakage through 
the fluidic vesicular membrane, we used a 1,2- 
dipalmitoyl-s7-glycero-3-phosphocholine (DPPC) 
vesicle instead. Over a wide temperature range— 
including ambient temperatures—this lipid has 
been reported to form a less fluidic gel-phase 
bilayer membrane with a thickness of 3.7 nm 
(14, 15), which barely permeates water at am- 
bient temperatures (J6). As observed by the 
membrane fluidity test based on the laurdan 
assay (17) (fig. S40), the channel-forming 
FMNR,s did not deteriorate the structural 
integrity of the bilayer membrane of DPPC. 
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To enhance the signal-to-noise ratio, we also 
changed the method of light scattering to 
fluorescence emission (18), which has been 
used to evaluate the water permeability of AQPs 
(19), nanoring-assembled synthetic nanochan- 
nels (20), and CNTs (27), for example. 

As a typical example, a HEPES buffer sol- 
ution of NaCl ((HEPES] = 10 mM, [NaCl] = 
500 mM) was added to a HEPES buffer dis- 
persion of ’’NR,-embedded DPPC vesicles 
(THEPES] = 10 mM, [NaCl] = 100 mM, [DPPC] = 
2.9 mM, ["!NR,] = 0.079 uM). The resulting 
salt concentrations in the exterior ([NaCl] = 
300 mM) and interior ([NaCl] = 100 mM) of 
the DPPC vesicular membrane were different 
from each other (Fig. 3A, ii); therefore, osmotic 
pressure was generated in the membrane, 
causing the DPPC vesicle to shrink to one- 


third of its original volume through outward 
water permeation. We wondered whether 
osmotic pressure can cause inward NaCl per- 
meation to release pressure. Through a separate 
set of experiments, we confirmed that the in- 
ward NaCl permeation was too slow to affect 
the evaluation of the outward water perme- 
ation (Fig. 4C and fig. S41). With reference to 
existing methods (19-21), we employed car- 
boxyfluorescein (CF; Fig. 3A) as an enclosed 
fluorescence probe for investigating water per- 
meation through fluorous ‘NC, because the 
fluorescence intensity of CF is concentration- 
dependent and decreases as a result of self- 
quenching upon volume shrinkage (fig. S42A). 
When the phospholipid bilayer membrane at 
25°C contained channel-forming "NR, at a 
percent mole fraction x of 0.0018 (J0), the 
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Fig. 4. Water and ion permeabilities through 
fluorous nanochannels in DPPC vesicles. 

(A) Single-channel osmotic water permeabilities p; of 
FNC, MNCs, FSNCe, and !NCg, along with 
those of CNT (21) and AQP1 (19). (B) Single-channel 
fluxes (f) of water permeation through *!NC,, 
AENCs, FSNCe, and ?NCg, obtained by dividing p; 
with their water-permeable cross-sectional areas. 
The values of CNT (21) and AQP1 (19) are also 
shown. (C) Reflection coefficients (values in the bar 
chart) and single-channel Cl permeabilities bo, 
(below the axis of the bar chart) of ?NC,, “NCs, 
FIBNCg, and '7NCg. ND represents that the value was 
under the detection limit. (D) Plots of intrinsic 
water/salt selectivity (P,,/P,) versus water perme- 
ability Py (cm? s). Py/P; and Py values for the 
econstituted membranes of F™NC,, CNT, and AQP1 
in a DOPC matrix were calculated with a reported 
method (27). The a values in the plots indicate 
fractional areas of the membranes occupied 

by the channels. The gray spots are the values for 
polymeric desalination membranes (10). The broken 
and solid lines in gray represent the trade-off and 
upper-bound lines, respectively, for the performance 
of polymeric desalination membranes (10). 


time-dependent fluorescence intensity change 
at >500 nm indicated that the CF fluorescence 
({CF]o = 0.5 mM) was quenched within 10 ms 
(Fig. 3B, red). By contrast, no fluorescence 
quenching occurred in the absence of channel- 
forming "NR, (Fig. 3B, black), as expected 
from the lack of water permeability of DPPC 
vesicles reported previously (16). The quench- 
ing profile shown in Fig. 3B (red) allowed us 
to evaluate the kinetic profile of the vesicle 
shrinkage calibrated by the method shown 
in fig. S42B and calculate the osmotic water 
permeability coefficient P; to be 0.22 cms” at 
x = 0.0018 (10). As shown in Fig. 3C, the P; 
values increased linearly as x increased from 
0 to 0.0027, indicating that the vesicle shrink- 
age rate in the experimental y range is governed 
by the amount of water permeated through 
the channel. Similarly, the P; values of other 
fluorous '™NC,s were obtained as shown in 
Fig. 3, D to F. To validate that water per- 
meation occurs through the fluorous nano- 
channel “"NC,,s, we conducted a set of water 
permeation experiments using "’°NR; in the 
presence of poly(ethylene glycols) (PEGs) with 
hydrodynamic diameters (D,,) ranging from 0.4 
to 3.2 nm (22). As typically observed for inter- 
ference in water permeation by clogging with 
polymers, P; decreased considerably when 
the D,, of PEG ranged from 1.1 to 1.8, similar 
to the interior diameter of NR, (1.46 nm) 
(fig. S43) (10, 23). 
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The number of *’NR, molecules per chan- 


nel (NV) was suggested to be nearly 10 on the 
basis of the DFT-optimized stacking distance 
of "NR, (0.41 nm; fig. $28) and membrane 
thickness of a DPPC vesicle (3.7 nm). The 
single-channel conductance calculated from 
this value was consistent with that observed 
using a planar bilayer membrane under the 
application of a voltage ranging from 50 to 
175 mV without any salt concentration gradient 
(fig. S45A). Because P; represents the water 
permeability per unit area of a phospholipid 
bilayer membrane, we calculated the single- 
channel water permeability p; by using Py, N, 
x, and the interior and exterior diameters of 
the vesicle (Fig. 4A). Unexpectedly, the cal- 
culated p; was 5.5 x 10°?° (em? s“ per channel), 
larger than the reported p; values for the AQP 
family (AQP1: p; = 1.2 x 10° em? s™ per chan- 
nel at 37°C) (19), CNT (p; = 2.3 x 10°? em? sper 
channel at room temperature) (27), and other 
synthetic water-permeable nanochannels pre- 
viously reported (20, 24-31) (table S1). Sim- 
ilarly, we evaluated the pr values of ™NC;, 
FIBNC,, and NC (Fig. 4A) and noted that 
these nanochannels exhibited a high water 
permeability, with the magnitude being the 
same order as that of "’NC4. 

To critically compare the water permeabil- 
ities of nanochannels with different channel 
diameters, we considered the corresponding 
flux values f (em? st nm -?)—which are nor- 
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malized permeabilities—taking into account 
the pore size [i.e., the water-permeable cross- 
sectional area (A), determined using MD sim- 
ulations (Fig. 1A)] (0). The results summarized 
in Fig. 4B were notable because the thinnest 
nanochannel, "NC, (A = 0.64 nm?), exhibited 
the largest f value (8.7 x 10° cm? s? nm”). 
The second-thinnest nanochannel, 'NC;, 
exhibited a slightly larger f (A = 1.67 nm’, 
1.9 x 107° cm? s? nm”) than those of other 
nanochannels "!8NCg (A = 2.43 nm?, 1.2 x 
107 cm? s! nm) and ?NC, (A = 2.83 nm?, 
1.0 x 10° cm? s* nm”); however, this value 
was one-fifth that of "’’NC,. The conventional 
understanding of pipe flow indicates that a 
thinner channel provides a smaller flux be- 
cause the shear stress on the interior channel 
surface is more dominant. However, the f 
values of "NC, to *!8NCg, as well as those 
of partially fluorinated ™’NCg, indicated the 
opposite tendency. This anomalous tendency 
in water flow through nanochannels has 
been predicted when the water-permeable 
cross-sectional area of the nanochannel is 
<0.2 nm? (32). Computational simulations 
suggest that water molecules in such con- 
fined nanospaces are in alignment and that 
there is an associated decrease in shear stress 
on the channel surface. In this context, it is 
of interest to examine the reasons for this 
trend even though the cross-sectional areas 
of our nanochannels are >0.64 nm”. As shown 
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in Fig. 1, C to E, fluorous nanochannels with 
a PTFE-like interior surface can break water 
clusters (fig. S46) (5, 6). This effect likely 
becomes more dominant as the diameter of 
the fluorous nanochannel decreases, leading 
to the enhancement of water permeation 
through the nanochannel. The cluster-breaking 
function is also essential for AQPs and CNTs 
to permeate water because their interior di- 
ameters [~0.3 nm (33) and ~0.4 nm (22, 20), 
respectively] are smaller than those of the 
water clusters (~0.8 nm) (34). Notably, polar 
amino acid residues and carboxylic acid groups 
at the entrance parts of AQPs and oxidatively 
cut CNTs, respectively, have been considered 
to promote the breakage of water clusters. 
Notably, the proposed fluorous nanochannels 
do not need to carry cluster-breaking func- 
tional groups at the entrance parts because 
their interior diameters are larger than those 
of the water clusters. Instead, the cluster- 
breaking nature of their interior fluorous sur- 
faces enhances the water permeability, as 
computationally predicted in the results shown 
in Fig. 1, D and E, and fig. S46. 

A key concern in developing water-permeable 
nanochannels is salt reflectance, which is im- 
portant in desalination applications. Although 
AQ@Ps can realize perfect desalination (33), 
CNTs—the best-performing synthetic nano- 
channels for fast water permeation—do not 
effectively reflect salts (21). Therefore, we first 
evaluated the salt reflectance properties of 
FmNC,,s based on the reflection coefficients 
(RCs) for NaCl. RC is the ratio between the 
water permeability with NaCl (Penacy) aS an 
osmolyte to that associated with sucrose 
(esucrose) AS a NOnpermeable osmolyte (fig. 
S477), i.€., Penaci/Pesucrose (10). RC is unity when 
the channel does not permeate NaCl (73) and 
smaller than unity when the channel does not 
reflect NaCl (28). The calculated RC for the 
thinnest "NC, (A = 0.64 nm?) was 1.02, in- 
dicating a substantially perfect reflection of NaCl 
(Fig. 4C). However, as the water-permeable 
cross-sectional area A increased from "NC, to 
FISNC, (4 = 1.67 nm?) and "NC, (A = 2.43 nm”), 
RC decreased to 0.93 and 0.79, respectively, 
indicating a negative correlation between A 
and the ability to reflect NaCl. For partially 
fluorinated "NC, (A = 2.83 nm”), RC was 
0.91, similar to that of *°NC;. Overall, all the 
nanochannels except "'“NCg exhibited excellent 
salt reflection profiles. We conducted single- 
channel Cl permeability (bq, cm? s? per 
channel) measurements based on a well- 
established stopped-flow fluorescence techni- 
que using channel-embedded DPPC vesicles 
that enclosed lucigenin, a Cl -sensitive dye 
(21, 27, 28). Cl. permeability can be approxi- 
mated to the permeability of salt. In general, 
charge neutrality is requested in the perme- 
ation of salt. Hence, if its cation or anion part 
does not permeate, the salt is considered to 
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be reflected (28). Although the bq values of 
FNC, and '°NC; were smaller than the de- 
tection limit, "NC, exhibited Cl-induced 
fluorescence decay characteristics correspond- 
ing to bq = 9.7 x 10° cm? s” per channel. 
Although *’NCg also showed the permeation 
of CI, the value of bg (3.8 x 10°-?° em? s™ per 
channel) was one order of magnitude smaller 
than that of ™°NC, (Fig. 4C and fig. $48). 
Meanwhile, we evaluated the Cl’ permeabil- 
ities of ""NC,s using the DOPC vesicles and 
obtained results that were consistent with 
those obtained using the DPPC vesicles (fig. 
S49). Considering the salt reflection (RC) 
and Cl permeation experiments, the thin- 
nest *!’NC, exhibited a nearly perfect re- 
flectance, even though its interior diameter 
(0.9 nm) was larger than the hydration di- 
ameter of Cl (0.66 nm) (35). We calculated 
the electrostatic potential maps of the four 
nanorings using the B3LYP/6-31G* level of 
theory. As shown in Fig. 2, E to G, nanorings 
FLNR,, TNRs, and *!®NR, were electro- 
statically negative at the interior surfaces 
because of the densely arranged polarized 
C-F bonds. Thus, the thinnest "NC, exhibited 
a highly powerful electrostatic barrier for the 
incorporation of Cl. 

We compared the desalination performances 
of the proposed fluorous nanochannels with 
those of reported examples by plotting the 
water permeabilities (P,,) versus the water/ 
salt selectivities (P,,/P,) (10). As shown in 
Fig. 4D, the overall desalination perfor- 
mance is higher if the plot of P,,/P, versus Py, 
is located more in the top right corner. The 
P,,/P; to Py plots of ™’NC,, "NC, “!8NCg, 
and "NC, are all located in the top right 
corner of the graph. 

We demonstrated that a fluorous nano- 
channel with a PTFE-like interior surface 
and an appropriate channel diameter can 
permeate water at an unprecedentedly high 
rate with a nearly perfect desalination. These 
properties originate from the electrostatically 
negative fluorous interior surface, which can 
break water clusters to enhance the water 
permeability and can also provide a power- 
ful electrostatic barrier for the incorpora- 
tion of Cl. 
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WETLAND ECOLOGY 


High-resolution mapping of losses and gains 


of Earth’s tidal wetlands 


Nicholas J. Murray“, Thomas A. Worthington’, Pete Bunting’, Stephanie Duce’, Valerie Hagger*, 
Catherine E. Lovelock’, Richard Lucas°, Megan I. Saunders®, Marcus Sheaves!, Mark Spalding®, 


Nathan J. Waltham’, Mitchell B. Lyons® 


Tidal wetlands are expected to respond dynamically to global environmental change, but the extent to 
which wetland losses have been offset by gains remains poorly understood. We developed a global 
analysis of satellite data to simultaneously monitor change in three highly interconnected intertidal 
ecosystem types—tidal flats, tidal marshes, and mangroves—from 1999 to 2019. Globally, 13,700 square 
kilometers of tidal wetlands have been lost, but these have been substantially offset by gains of 

9700 km?, leading to a net change of -4000 km? over two decades. We found that 27% of these 
losses and gains were associated with direct human activities such as conversion to agriculture and 
restoration of lost wetlands. All other changes were attributed to indirect drivers, including the effects 


of coastal processes and climate change. 


idal wetlands are of immense impor- 

tance to humanity, providing benefits 

such as carbon storage and sequestra- 

tion, coastal protection, and fisheries 

enhancement (J, 2). Unfortunately, inten- 
sification of anthropogenic pressure and the 
growing impacts of climate change are affect- 
ing tidal wetlands and their component in- 
tertidal ecosystems in pervasive ways. Losses 
of tidal wetlands are widely reported (3-5), 
although at local scales intertidal ecosystems 
are known to have the capacity to respond 
to environmental change, gaining extent by 
means of sediment accumulation, inland 
migration, and redistribution (6-9). Redistrib- 
ution and recovery through natural processes 
are increasingly supplemented by broad-scale 
ecosystem restoration activities (10). Although 
a number of studies have suggested that inter- 
tidal ecosystems are highly resilient to envi- 
ronmental change (//, 12), little is known about 
the degree to which gains in tidal wetland ex- 
tent have counterbalanced known losses. Pre- 
vious analyses have been unable to address this 
question due to factors such as focus on map- 
ping single ecosystem types (9, 13, 14) (which 
cannot distinguish losses from transitions 
among adjacent intertidal ecosystems), a lack 
of consistent data on the global extent and 
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change of tidal marshes (75), and uncertainty 
about the prevailing drivers of tidal wetland 
change. This has led to considerable uncer- 
tainty about how tidal wetlands have changed 
in recent decades and how they are expected 
to persist in the future (JJ, 12). 

We report an integrated, globally consistent 
analysis of the distribution and change of 
Earth’s three intertidal ecosystems: tidal flats, 
tidal marshes, and mangroves (hereafter refer- 
red to collectively as “tidal wetlands”; fig. S1). 
Where they co-occur, these three ecosystems 
are highly interconnected, with feedback mech- 
anisms among biological and physical com- 
ponents that interact extensively across the 
systems (8). We investigate the spatiotemporal 
distribution of tidal wetlands globally by ap- 
plying stacked machine learning classifiers to 
remotely sensed data to model their occur- 
rence, detect the type and timing of loss and 
gain events, and assess the drivers of change 
over the study period (1999 to 2019). The 
validated dataset was produced by combining 
observations from 1,166,385 satellite images 
acquired by the Landsat 5 to 8 missions with 
environmental data of variables known to 
influence the distributions of each ecosys- 
tem type, including temperature, slope, and 
elevation (tables S1 and S2). Tidal wetland 
loss was defined as the replacement of any of 
the three focal ecosystems with nonintertidal 
ecosystems at the 30-m pixel scale, with tidal 
wetland gain defined as their establishment 
in pixels where they did not occur in 1999. A 
weighted random sample of detected changes 
was used to estimate the contribution of 
direct human impacts versus indirect drivers, 
such as sea level rise and natural coastal 
processes, on tidal wetland losses and gains 
globally (76). 

Our global dataset reveals that the total 
observed area of tidal wetlands in 2019 was at 
least 354,600 km? [95% confidence interval 


(CI): 244,800 to 363,900 km”]. Tidal wetlands 
are unevenly distributed across the world’s 
coastlines, with the largest remaining contig- 
uous tracts occurring as deltaic mangroves 
fringed by extensive tidal flats in the Amazon 
Delta, the Northern Bay of Bengal, New Guinea, 
and the Niger Delta (Fig. 1A). Previous esti- 
mates of global tidal marsh extent rely on 
spatial data compilations with large gaps in 
coverage that lead to underestimates of ex- 
tent (5), limiting their use for estimating 
global blue carbon stocks (17). Our data there- 
fore allow a first empirical estimate of global 
tidal marsh extent of 90,800 km?, obtained by 
subtracting previously derived extent estimates 
of mangroves (135,900 km?) and tidal flats 
(127,900 km?) from our global tidal wetland 
area estimate (9, 18). Our estimate of tidal 
marsh extent represents 25.6% of the total 
tidal wetland extent mapped in this study 
and is 65.1% greater than a previously re- 
ported minimum global estimate of 55,000 km? 
(15). Because our methods are limited in re- 
gions higher than 60°N latitude, where tidal 
marshes and tidal flats are known to occur, 
this upward revision of global tidal marsh 
extent should be considered conservative. 

We analyzed tidal wetland change over the 
20-year study period and found that losses of 
13,700 km? (95% CI: -16,800 to —8200 km?) have 
been substantially offset by the establishment 
of 9700 km? (95% CI: +4900 to +15,700 km?) 
of new tidal wetlands that were not present in 
1999 (Table 1). Despite wide geographic varia- 
tion in the occurrence of tidal wetlands glob- 
ally, many regions showed a consistent pattern 
of losses being substantially offset by nearby 
gains (Fig. 1, B and C). This pattern was most 
pronounced in the world’s major river deltas 
(19), where about one-fifth (19.1%) of the area of 
tidal wetland changes occurred, despite con- 
taining only 7.5% of the world’s tidal wetland 
extent (Fig. 2A). We recorded the greatest tidal 
wetland change in the Ganges-Brahmaputra 
(1070 km?) and Amazon deltas (730 km?), both 
of which have exhibited increased tidal wet- 
land extent since 1999 (ratio of loss to gain 
0.92 and 0.98, respectively). Many deltas have 
experienced a net increase in total extent over 
the past three decades as a result of increases 
in fluvial sediment supply caused by catch- 
ment deforestation and increased upland soil 
erosion (79). Our data, however, suggest a net 
loss of tidal wetlands on deltas globally, though 
gains of 2100 km? alongside losses of -2300 km 
indicate the considerable dynamism of these 
systems. The latter have been associated with 
multiple direct drivers of change, such as con- 
version to agriculture and aquaculture (9, 14), 
urban expansion (20), and geomorphic changes 
due to dikes and channel diversions (20), 
together with many indirect drivers, including 
shoreline erosion (9, 20), compaction, sub- 
sidence and sea level rise (27), storm-driven 
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Tidal Wetland Gains 


Fig. 1. The global distribution of major tidal wetlands and their changes 
from 1999 to 2019. (A) The 2019 distribution of tidal wetlands is 

modeled as the combined distribution of the world’s three main intertidal 
ecosystems: tidal flats, tidal marshes, and mangroves. Darker colors 


vegetation loss (22), pollution (23), and altered 
sediment supply (19). 

Of the three intertidal ecosystems included 
in our analysis, tidal flats experienced both 
the greatest loss (7000 km?; 95% CI: 4200 to 
8600 km?) and gain (6700 km?; 95% CI: 3400 
to 10,800 km?), accounting for almost two- 
thirds (58.8%) of the total tidal wetland area 
change (Table 1 and Fig. 3A). A ratio of loss to 
gain of 1.1 indicates that newly established 
tidal flats have made a substantial contribu- 
tion to offsetting the magnitude of their net 
loss globally. By contrast, mangroves had the 
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per 2° grid cell. 


highest ratio of loss to gain (3.0), with an esti- 
mated net decrease in extent of 3700 km? (95% 
CI: -5400 to -2100 km”), indicating that ex- 
tensive mangrove losses have only been par- 
tially offset by the 1800-km? (95% CI: +900 
to +3000 km?) of new mangroves detected 
by our analysis. Tidal marshes had the lowest 
total area change and were the only ecosys- 
tem to have a loss to gain ratio of <1, indicat- 
ing that their gain marginally exceeded their 
loss for an estimated net increase of 100 km? 
(95% CI: 0 to +100 km”). Our estimates of 
global tidal wetland change partitioned by 


Tidal wetland 
extent km? 
per 2° cell 


0-50 
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indicate greater area of tidal wetlands per 2° grid cell. (B) Losses and 
(C) gains over the period 1999 to 2019. Circle sizes indicate the extent of 
tidal wetland loss and gain over the study period in square kilometers 


ecosystem type agree in magnitude with re- 
cently published estimates of changes in man- 
grove extent from 2000 to 2016 of -3400 km” 
(14). No comparable global estimates of tidal 
flats and tidal marsh change are available 
because of differing coverage of change analy- 
ses of tidal flats (9) and a lack of any data 
sufficient to support global change analyses 
of tidal marshes (5). 

About three-quarters of the 4000 km? net 
global tidal wetland decrease between 1999 
and 2019 occurred in Asia (74.1%), with 68.6% 
concentrated in just three countries: Indonesia 
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(36%), China (20.6%), and Myanmar (12%). 
Losses of tropical mangrove forests across 
Southeastern Asia—particularly Indonesia and 
Myanmar—are well-documented (14, 18), and 
the extensive impact of coastal land conver- 
sion was confirmed by our analysis (Fig. 3B). 
In China, there was also a large net reduction 
of tidal flat extent of >1000 km? largely due 
to reclamation (24) but net gains of tidal 
marshes (+ 200 km?) that coincided with the 


rapid expansion of Spartina alterniflora across 
the country’s intertidal zone (Fig. 3C) (25), 
thus reducing China’s net tidal wetland loss 
to 800 km? (table $3). Outside of Asia, tidal 
wetlands in Africa had the highest ratio of loss 
to gain (1.6), indicating a strong loss dynamic 
that has been associated with severe man- 
grove degradation, which is most intense in 
Nigeria, Mozambique, and Guinea-Bissau (Fig. 
1B and table S3). 


Interpretation of a globally distributed ran- 
dom sample of tidal wetland losses and gains 
suggested that 39% of losses and 14% of gains 
were caused by direct human activities (table 
$9). Direct human activities were defined as 
observable activities occurring at the location 
of the detected change (26), including con- 
version to aquaculture, agriculture, planta- 
tions, coastal developments, and construction 
of physical structures such as seawalls and 


Eee 
Table 1. Tidal wetland change estimates by intertidal ecosystem type for different regions of the world from 1999 to 2019. Change estimates are in 
square kilometers. Tidal wetlands in this study collectively refer to tidal flat, tidal marsh, and mangrove ecosystems, such that area change of tidal 
wetlands is the sum of the change area of the three component intertidal ecosystems. Per-pixel losses and gains were summarized at these regional 
scales, with 95% confidence intervals derived from quantitative accuracy assessment in parentheses. Analysis units are realms from the Marine 
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dikes (9) (fig. S8). They also include drivers of 
gain such as mangrove planting, restoration 
activities, or coastal modifications to promote 
tidal exchange (Fig. 2B and fig. S8). 

At the continental scale, Asia was identified 
as the global center of tidal wetland loss from 
direct human activities (fig. S9). In Asia, direct 
drivers accounted for more than two-thirds of 
the losses of each ecosystem (mangrove, 75%; 
tidal marsh, 69%; tidal flats; 62%; table S10), 
confirming the negative effects of widespread 
coastal transformation on coastal ecosystems. 
Although the impact of coastal development 
on mangroves and tidal wetlands have been 
previously reported (9, 74), our results reveal 
that Asian tidal marshes have similarly been 
severely degraded by human activities. Com- 
pared with Asia, direct human activities had a 
much lesser role in the losses of tidal wet- 
lands in Europe (28%), Africa (27%), North 
America (9%), South America (2%), and 
Oceania (0%; fig. S9). 

Indirect or ex situ drivers include both 
natural coastal processes and those influenced 
by human activities remotely from the location 
of observed change. They include processes of 
isostatic change (27), sea level rise (8), storm 
impacts (22), erosion and progradation (22), 
along-shore coastal development (9), and the 
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combined effects of all of the above. More than 
90% of tidal wetland losses in North America 
(91%), South America (98%), and Oceania 
(100%) were attributed to indirect drivers (fig. 
S9). Globally, indirect drivers accounted for 
most losses of tidal marshes (78%) and tidal 
flats (66%), whereas mangrove losses were 
equally a result of direct and indirect drivers 
(50%; Table S9). 

Most tidal wetland gains (86%) were the 
result of indirect drivers, highlighting the 
prominent role that broad-scale coastal pro- 
cesses have in controlling tidal wetland extent 
and facilitating natural regeneration. However, 
disentangling the specific processes underpin- 
ning tidal wetland change is challenging with 
analyses conducted at large spatial scales. In 
most cases direct drivers could be clearly 
identified, but many indirect drivers operate 
over large spatial and temporal scales and 
may originate tens to thousands of kilometers 
from an observed tidal wetland change. Change 
in ecosystem extent can also be the result of 
more than one indirect driver or of interac- 
tions between drivers. Our work therefore 
suggests a need for continued monitoring, 
experiments, and models that can account for 
these complexities to help characterize and 
predict global tidal wetland dynamics. 


@Tidal wetland gain 


There is potential to use our analysis, which 
is designed to be periodically updated, to track 
larger-scale coastal ecosystem restoration acti- 
vities. Although there has been a surge in 
coastal restoration efforts worldwide (27), 
many of these have been unsuccessful (10). 
Monitoring progress of restoration remotely, 
independently, and at broad scales could con- 
tribute to reporting on international con- 
servation initiatives such as the UN Decade on 
Ecosystem Restoration, on targets associated 
with the Convention on Biological Diversity, and 
on mitigation commitments made under the 
UN Framework Convention on Climate Change 
(27). The driver analysis indicated that 14% of 
observed tidal wetland gains were attributable 
to direct human interventions (table S9). These 
activities were most apparent for tidal marshes 
and were typically the product of site scale 
restoration activities (Fig. 2B and fig. S8). 

Our analysis enables the detection and char- 
acterization of dynamic transitions among 
intertidal ecosystem types globally. Transi- 
tions have been linked to a number of physical 
and climatic factors such as sea level rise, geo- 
morphic changes, and variation in temper- 
ature and rainfall (28, 29). We found that 1.9% 
of the world’s tidal wetlands exhibited tran- 
sitions among ecosystem types over the study 


Fig. 2. Representative examples of 1999 to 
2019 tidal wetland loss and gain. (A) Losses 
and gains of tidal flats and tidal marshes after 
diversion of the main channel in 1996 in the Yellow 
River delta in China (left). Reference images 
acquired in 1998 (middle) and 2020 (right). 

(B) Tidal marsh gain due to Europe's largest 
coastal wetland restoration project, UK (left). 
Reference images acquired in 1999 (middle) 

and 2018 (right). (©) Mangrove loss due to 
tectonic subsidence after the Aceh-Andaman 
earthquake of 2004, Katchal Island, Nicobar 
Islands (left). Reference images acquired 1992 
(middle) and 2019 (right). Imagery data are 

from USGS (A) and (C) and Google Earth Pro (B). 
All scale bars are 5 km. 


TAT 


13 MAY 2022 « VOL 376 ISSUE 6594 


RESEARCH | REPORTS 


Fig. 3. Tidal wetland change totals 
from 1999 to 2019. Panels show 
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period (6700 km?; table $4). Transition events 
tended to be spatially clustered with areas of 
large losses and gains and in many cases may 
be linked to the same drivers (8). More than 
55% of transitions (>3600 km”) involved coloni- 
zation of tidal flats by marshes or mangroves 
and a further 27% consisted of transitions from 
mangrove to tidal marsh or vice versa (table S4). 

Our classifier accurately detected known 
events of coastal change across the three eco- 
system types. For example, the magnitude 9.2 
Aceh-Andaman earthquake on 26 December 
2004 caused up to 2.9 m of tectonic subsidence 
in the Andaman and Nicobar Islands, leading 
to land submergence and a loss of >90% of 
mangrove extent in some localities (Fig. 2C) 
(30). However, as for all earth observation- 
derived estimates of land cover change, there 
are limitations. These include the spatial re- 
solution of sensor data (which limit the ability 
of our analysis to detect change in narrow 
linear features such as waterways), model 
uncertainty, errors of omission and commis- 
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sion, and a lack of polar coverage. Validation 
of our data products was effective in character- 
izing these uncertainties, which were propa- 
gated through our estimates of tidal wetland 
extent and change (tables S5 to S8). 

By simultaneously mapping three of Earth’s 
intertidal ecosystems, this work enables 
a synoptic view of change of three of the 
world’s highly connected intertidal coastal 
ecosystems. This approach offers an advantage 
over single-ecosystem mapping studies as 
short- and long-term dynamic transitions 
between ecosystem types can cause consider- 
able apparent change in individual ecosys- 
tems. Although our study is unable to account 
for the impact of centuries of anthropogenic 
coastal transformation and related pressures 
(8), it has established an observational record 
of recent tidal wetland changes with prelimi- 
nary attribution of change drivers. Such infor- 
mation has the potential to promote objective 
monitoring of conservation and restoration 
efforts, assess the impacts of elevating pres- 


sures, guide new studies of changing ecosystem 
structure, function, and service provision in 
newly established tidal wetlands (8, 29), and 
improve our understanding of the resilience 
of tidal wetlands in the face of global change. 
In turn, it can support efforts to anticipate the 
future of global coastal environments and to 
develop adaptive responses to change. 
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ORGANIC CHEMISTRY 


Modular access to substituted cyclohexanes 


with kinetic stereocontrol 


Yangyang Li’, Yugiang Li’, Hongjin Shi?, Hong Wei‘, Haoyang Li’, Ignacio Funes-Ardoiz~*, Guoyin Yin* 


Substituted six-membered cyclic hydrocarbons are common constituents of biologically active 
compounds. Although methods for the synthesis of thermodynamically favored, disubstituted cyclohexanes 
are well established, a reliable and modular protocol for the synthesis of their stereoisomers is still 
elusive. Herein, we report a general strategy for the modular synthesis of disubstituted cyclohexanes 

with excellent kinetic stereocontrol from readily accessible substituted methylenecyclohexanes by the 
implementation of chain-walking catalysis. Mechanistically, the initial introduction of a sterically demanding 
boron ester group adjacent to the cyclohexane is key to guiding the stereochemical outcome. The 
synthetic potential of this methodology has been highlighted in late-stage modification of complex bioactive 
molecules and in comparison with current cross-coupling techniques. 


hree-dimensionally complex geometries 

frequently confer improved biological 

activities and physical features to drug 

molecules compared to their flat bio- 

isosteres (7-3). Creating efficient syn- 
thetic protocols to construct saturated rings in 
a stereospecific manner has therefore stimu- 
lated great interest (4-6). Cyclohexanes bearing 
one substituent in the equatorial position and 
another in the axial position, namely the 1,2- 
cis, 1,3-trans, and 1,4-cis substitution patterns 
(Fig. 1A), represent a key scaffold in a wide 
range of molecules of pharmaceutical interest, 
such as linrodostat (7), an anticancer phase 3 
drug; cathepsin S inhibitor (8); oxysterols 
receptor LXR-beta (9); and histamine H3R 
ADS10227 (10) (Fig. 1B). Furthermore, they 
frequently exhibit better bioactivity than their 
thermodynamically favored isomers (JO, 11). 
Although studies toward the development of 
efficient methods for the construction of cyclo- 
hexane skeletons have not stopped since the 
discovery of the Diels-Alder reaction (12-14), 
modular and variable approaches for the 
synthesis of thermodynamically disfavored 
substituted cyclohexanes are still lacking. Con- 
ventional cross-couplings of cyclohexyl derivatives 
offer a reliable platform to prepare substituted 
cyclohexanes, but these methods uniformly 
favor thermodynamically stable stereoisomeric 
products, that is, the 1,2-trans, 1,3-cis, and 1,4- 
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trans isomers (15-17). Selective hydrogenation 
of trisubstituted alkenes (78-22) or hydro- 
functionalization of 1,1-disubstituted alkenes 
(23) would be alternative pathways to access 
these compounds. However, controlling facial 
selectivity in these processes is still a chal- 
lenging task (Fig. 1C, left). Although transition 
metal-catalyzed hydrogenation of substituted 
arenes is particularly appealing as a route to 
all-cis-substituted cyclohexanes (24-26), this 
strategy does not afford access to cyclohexanes 
with 1,3-trans substituents. Given the preva- 
lence of cyclohexane scaffolds and the limita- 
tion of present synthetic methods, development 
of strategies for the modular synthesis of the 
thermodynamically disfavored disubstituted 
cyclohexanes in a highly stereoselective and 
efficient manner would not only greatly enrich 
the toolbox of organic synthetic chemists but 
also enlarge the compound library available 
for drug discovery. 

Inspired by the stereoselectivity induced by 
the size of the metal hydride reagent in the 
reduction of substituted cyclohexanones (27), 
as well as our previous studies on migratory 
difunctionalization of alkenes (28-30), we 
speculated that if a bulky group could be 
incorporated into the exocyclic position of a 
substituted methylenecyclohexane, it would 
generate a sterically demanding trisubstituted 
alkene capable of enhancing stereodiscrimi- 
nation by a catalyst (Fig. 1C, right). Subsequent 
coupling with an electrophile would afford the 
thermodynamically disfavored disubstituted 
cyclohexane with a good diastereomeric ratio 
(dr). Herein, we report our successful imple- 
mentation of this concept by in situ incorpo- 


ration of a transformable boron group through 
the use of substituted methylenecyclohexanes 
as coupling partners in a nickel-catalyzed 
migratory carboboration reaction (Fig. 1D). 
The installation of a boron functional group 
not only plays an important role in controlling 
the stereoselectivity but also generates a key 
reactive site for further late-stage function- 
alization by means of well-known C-B bond 
functionalization chemistry. 

We commenced this study by choosing ter- 
minal alkene 1, bis(pinacolato)diboron (Bopin,) 
(2), and organohalide 3 as the model substrates. 
Ligand evaluation demonstrated that the che- 
moselectivity of this reaction was highly depen- 
dent on the ligand framework and that sterically 
hindered pyridinyl oxazoline (PyrOx) ligands 
could efficiently promote the formation of 1,4-cis- 
product 4& (Fig. 2A). After carefully examining 
each reaction parameter, we identified the com- 
bination of nickel acetylacetonate [Ni(acac),] 
and PyrOx ligand L6 as precatalyst, lithium 
methoxide (LiOMe) as base, and N-methyl-2- 
pyrrolidone (NMP)/1,4-dioxane (9:1) as solvent 
at 35°C as optimal reaction conditions (see sup- 
plementary materials for details), which furnished 
the 1,4-cis-product 4 with excellent cis/trans 
selectivity (>99:1). In contrast, when Boping 
was replaced by pinacolborane (HBpin), anti- 
Markovnikov hydroalkylation product 8 was 
obtained in 30% yield, but with a poorer 
diastereoselectivity (cis/trans = 5:1) (Fig. 2B), 
while the high level of diastereoselectivity (cis/ 
trans = 83:1) was retained in the hydroalkylation 
reaction of alkenylboron 5 (Fig. 2C). The 
diminished reaction efficiency was caused by 
alkene isomerization, which was probably due 
to the formation of unexpected Ni(D-H species 
during the precatalyst activation (37). These 
results clearly support the hypothesis that the 
initial incorporation of the pinacol boronic 
ester (Bpin) group plays an important role in 
increasing the stereoselectivity of this reaction. 
To further confirm this effect, density func- 
tional theory calculations (see supplementary 
materials for computational details) were car- 
ried out on the Ni-H migratory insertion step 
(Fig. 2D). After a careful transition-states analy- 
sis of the different equatorial and axial con- 
formers of the Ni-H insertion into the alkene 1, 
we found that cis migratory insertion is favored 
by 3.7 kcal/mol owing to the large steric hin- 
drance for the trans migratory insertion between 
the PyrOx ligand and the boron functional 
group. Despite the small size of hydrogen, the 
local steric hindrance in the transition state, 
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Fig. 1. Synthesis of substituted cyclohexanes with kinetic stereocontrol. 
(A) Representative bioactive molecules containing thermodynamically 
disfavored disubstituted cyclohexanes. Me, methyl; n-Bu, n-butyl; Ph, phenyl. 
(B) Thermodynamically favored versus disfavored disubstituted cyclohexanes. 


produced by the entire [Ni]-H system, favors 
the installation of the hydrogen atom in the 
equatorial plane where the Bpin group and 
PyrOx ligand are placed at longer distances 
(the same effect is observed for 1,3 insertion; 
see supplementary materials for details). In 
contrast, low stereodifferentiation is found 
in the absence of the Bpin group, with both 
transition states almost isoenergetic owing to 
the larger conformational flexibility of the PyrOx 
ligand in both axial and equatorial positions. 
We next focused on the generality of this 
nickel-catalyzed reaction, first with a series of 
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4-substituted methylenecyclohexanes (Fig. 3A). 
The corresponding secondary alkyl boronic 
esters could be obtained in moderate to good 
yields with exclusive 1,1-regio- and excellent 
1,4-cis-stereoselectivity (from 91:9 to >99:1 dr) 
when a diversity of substituents including 
alkyl, ester, and aryl groups (9 to 25 and 29 to 
31), as well as heteroatoms (26 to 28), were 
attached. Notably, even with a small methyl 
substituent (32), good diastereoselectivity 
(dr = 91:9) could be obtained. Moreover, the 
resulting cyclohexane frameworks (20 and 
23 to 25) have widespread applications in 


ee 
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(C) Previous studies and the conception of this work. FG, functional group; 
Cat, catalyst; G, group. (D) Execution of this work: modular synthesis of 
substituted cyclohexanes with kinetic stereocontrol via nickel-catalyzed 
migratory carboboration of substituted methylenecyclohexanes. 


liquid crystalline materials (32, 33). Variation 
of the substituent on the arene ring of the 
benzylic halide did not affect the efficiency or 
stereoselectivity. Next, we examined methy- 
lenecyclohexanes bearing a 3-substitutent and 
obtained excellent 1,3-trans-diastereoselectivities 
(Fig. 3B). Furthermore, the 1,2-cis-cyclohexane 
derivatives were obtained from 2-substituted 
methylenecyclohexanes with the same high 
diastereoselectivity (Fig. 3C). Besides six- 
membered carbon rings, the heterocyclic 
piperidine derivatives (42 to 44) were also 
compatible with the developed protocol, and 
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Fig. 2. Reaction development and proof of concept. (A) Identification of the 
optimal reaction conditions. R, radical group; tBu, tert-butyl; iPr, isopropyl. 

(B) Hydroalkylation of terminal alkene. (©) Hydroalkylation of alkenylboron. 
(D) Density functional theory computational studies on the role of the Bpin 


the corresponding kinetic products were ob- 
tained with high facial selectivity (Fig. 3D). 
The two-component coupling products or alkene 
isomerization products make up most of the 
remaining mass balance of these reactions (see 
supplementary materials for details of represen- 
tative examples). The stereochemistry (13, 16, 
22, 25, 27, 28, 32, and 4:1) was unambiguously 
confirmed by single-crystal x-ray crystallog- 
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standard conditions 


8: 30% 
cis/trans = 5:1 


TS-trans-Bpin 


TS-trans-H TS-cis-H 


lengths in angstroms. 


raphy. A wide range of flexible functional 
groups including aryl bromide (13), esters 
(16 and 19), phenol (21), cyano (22), ketal 
(25), benzylether (26), imide (27), and amide 
(28) were all well tolerated under the mild 
reaction conditions. 

Encouraged by the above results, we sub- 
sequently investigated architecturally com- 
plex molecules bearing a multiply substituted 


35 °C, 18h 


4: 20% 
cis/trans = 83:1 


f  Asct= 0.0 kcavmol 


\ 


P as 


AAG* = 0.2 kcal/mol 


group in the stereochemistry of this reaction. AAG#, activation Gibbs free 
energy difference referred to the most stable transition state; TS, transition 
state. The numbers within the ball-and-stick models represent the bond 


methylenecyclohexane moiety. As illustrated 
in Fig. 3E, the complex molecules derived 
from octahydroindole (45), lithocholic acid 
(46), nootkatone (47), (—)-menthone (48), and 
drostanolone propionate (49), which bear diverse 
substitution patterns, were all transformed into 
the single product isomer by this protocol. In 
addition, (—)-caryophyllene oxide (50) bearing 
aterminal alkene attached to a nine-membered 
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Fig. 3. Reaction scope. (A) Reactions of 4-substituted methylenecyclohexanes. (B) Reactions of 3-substituted methylenecyclohexanes. EtO, ethylene oxide; OMe, 
methoxy group; BocHN, tert-butyl carbamate; OiPr, isopropoxy; OEt, ethoxy functional group. (C) Reactions of 2-substituted methylenecyclohexanes. (D) Reactions of 
aza-hetereocyclic alkenes. (E) Late-stage modification of complex molecules. The asterisk symbol denotes isolated yield of the borylated product. The single-dagger symbol 
denotes isolated yield of the alcohol after H2O> oxidation. The double-dagger symbol denotes isolated yield of the ketone after Dess-Martin oxidation. The section symbol indicates 
that no relative stereoselectivity was observed for the carbon center bearing the Bpin group in this case. MCH, methylenecyclohexane. 
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Fig. 4. Diversifications of the representative products. 


ring, estradiene dione-3-keta (51) bearing a five- 
membered ring, and (+)-nootkatone (52) with 
an acyclic terminal alkene could also smoothly 
participate in this nickel-catalyzed reaction to 
deliver the corresponding products with excel- 
lent facial selectivity (fs > 95:5). These results 
demonstrate the compatibility and potential 
of this protocol in late-stage diversification of 
complex bioactive molecules. 

The incorporation of boron functional 
groups into molecules is particularly attractive 
because of their diverse downstream trans- 
formations (34). To further illustrate the syn- 
thetic potential of this method, additional 
diversifications of products were conducted. 
As illustrated in Fig. 4, using czs-13 as the 
model substrate, the carbon-boron bond could 
be transformed into C-C(sp?) (53) or C-C(sp”) 
(54) bonds by Matteson reactions (35), as well 
as a C-N bond by Liu’s amination protocol 
(55) (36). Moreover, a palladium-catalyzed 
cross-coupling of the aryl bromide moiety of- 
fered the amination product with the boronic 
ester intact (56) (37). Consequently, several 
structurally diverse cis-1,4-disubstituted cyclo- 
hexanes were obtained. In addition, bicyclic 
ether 57 and heterobicyclic compound 58 
were also readily prepared from the products 
(38, 39). 

To highlight the value of this protocol, a 
comparative study was performed using clas- 
sical cross-coupling methodologies and our 
method (fig. S1). In general, cyclohexanones 
(or cyclohexanols) are readily available com- 
pounds, which can undergo reduction, haloge- 
nation, and classical cross-couplings to furnish 
the thermodynamically favored products (fig. 
S1, left) (15, 40, 41). In contrast, the same 
ketones can undergo olefination followed by 
this protocol to generate their analogs with a 
reversed stereocenter (fig. S1, right). What is 
particularly noteworthy is that this observed 
stereochemistry not only holds in the simple 
disubstituted cyclic products (cis-8) but is also 
observed for multisubstituted cyclohexane mol- 
ecules (60c and 62c). This is complementary to 
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current hydrogenation technologies that form 
the thermodynamically favored products (/8). 

With this work, we have achieved a nickel- 
catalyzed chemoselective three-component 
coupling reaction, which provides expedient 
access to the framework of thermodynamically 
disfavored substituted cyclohexanes from easily 
accessible methylenecyclohexanes, B.pin,, and 
benzyl halides. A wide scope of functional 
groups, including aryl bromide, ester, amine, 
amide, imide, and o,,h-unsaturated ketone, were 
all compatible with this transformation. 
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BIOMECHANICS 


Recovery mechanisms in the dragonfly 
righting reflex 


Z. Jane Wang'3*, James Melfi Jr.2, Anthony Leonardo* 


Insects have evolved sophisticated reflexes to right themselves in mid-air. Their recovery mechanisms 
involve complex interactions among the physical senses, muscles, body, and wings, and they must 
obey the laws of flight. We sought to understand the key mechanisms involved in dragonfly righting 
reflexes and to develop physics-based models for understanding the control strategies of flight 
maneuvers. Using kinematic analyses, physical modeling, and three-dimensional flight simulations, 
we found that a dragonfly uses left-right wing pitch asymmetry to roll its body 180 degrees 

to recover from falling upside down in ~200 milliseconds. Experiments of dragonflies with blocked 
vision further revealed that this rolling maneuver is initiated by their ocelli and compound eyes. 
These results suggest a pathway from the dragonfly’s visual system to the muscles regulating wing 
pitch that underly the recovery. The methods developed here offer quantitative tools for inferring 
insects’ internal actions from their acrobatics, and are applicable to a broad class of natural and 


robotic flying systems. 


ragonflies are some of the most ancient 

insects, and they offer important clues 

to the evolution of flight control strat- 

egies. Among their acrobatic feats is an 

ability to right themselves when falling 
upside down (J). Aerial righting behavior has 
been found in different animal taxa, including 
small mammals (2), flightless insects (3), rep- 
tiles (4), flying insects (5), birds (6), and bats 
(7), suggesting a convergent behavior that may 
shed light on the origin of flight (8). There has 
been much progress in the experimental char- 
acterization of aerial maneuvers (5, 9-17). In 
particular, high-speed video tracking has of- 
fered increasingly detailed flight kinematics 
since Marey’s classical films of a falling cat. 
Obtaining well-resolved three-dimensional (3D) 
flight kinematics is, however, nontrivial; care- 
ful experiments and calibrations are required. 
Understanding the physical mechanisms of 
righting further requires quantitative models 
of flight (78-20). Here, we developed experi- 
mental and computational methods for ana- 
lyzing aerial maneuvers, including free-flight 
simulations that account for both unsteady 
aerodynamics and body-wing inertial coupling, 
so as to tease out the key wing asymmetries 
that lead to the exhibited maneuver. These 
subtle wing modulations provide important 
clues to the internal control schemes for aerial 
maneuvers. Dragonflies control each wing sepa- 
rately with direct muscles. Our computations, 
together with statistical analyses, show that 
the observed rolling motion results primarily 
from wing pitch asymmetry. Our behavioral 
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experiments further show that vision is critical 
for eliciting the observed rolling response. 
These results imply pathways between the 
dragonfly’s visual system and the direct muscles 
that modulate the wing pitch (Fig. 1). 

To start, we set up an experiment in which 
the dragonflies were released from a magnetic 
tether from different orientations (Fig. 1 and 
movie S1). The dragonflies performed stereo- 
typical maneuvers (7), which depend on their 
initial body orientation (fig. S3), offering the 
possibility for controlled studies. To under- 


Roll to Recover 


/ Magneti 
tether 


Tracked at 4000fps 


Muscle 
torque 


stand righting mechanisms in response to free 
fall, it is necessary to design an experiment in 
which the righting reflex can be differentiated 
from other reflexes such as those involved in 
take-off or loss of leg contact with the perch. 
For example, a rapid retraction of a perch away 
from the dragonfly (27), to induce a fall, elicits 
a reflex in the legs as well as a subsequent 
righting reflex; the two are interleaved in 
time and are difficult to discriminate. It is 
also critical to have reproducible initial start- 
ing conditions and to describe the dynamic 
states with proper definition of body trans- 
lation and rotation. Thus, we focused on 
experiments in which the legs were free from 
contact during the initial release to avoid a 
confounding tarsal reflex, and we analyzed 
the recovery maneuvers from falling upside 
down with nearly identical initial conditions 
(supplementary text). 

When falling upside down from a resting 
state, dragonflies rolled 180° to reorient. The 
righting episode lasted for about 200 ms (7 
dragonflies, 37 trials, 224 + 34 ms), which in- 
cludes a reaction time of 99 + 26 ms and active 
recovery of 125 + 28 ms (three to five wing 
beats). The body rolled at a nearly steady rate, 
with a sharp transition at the beginning and 
the end of the episode (Fig. 2). This repeatable 
maneuver allowed us to quantify flight patterns 
in detail. To obtain the time series of the wing 
and body kinematics at high spatial and tem- 
poral resolutions, we filmed flights using 
three high-speed video cameras at 4000 fps, 


Behavioral experiments 
SI movies 


Inferred from body rotation 


fig 2C, 2D 
Wing pitch Kinematics Statistics 
Kinematics Computation 
Body roll fig 2 fig 4 


Fig. 1. The key actions along the recovery pathway and the corresponding analysis methods. The 
dragonflies were released from a magnetic tether in an upside-down orientation and were filmed using three 
high-speed cameras at 4000 fps. The dragonflies rolled 180° to reorient themselves by means of left and 
right wing pitch asymmetry, which was elicited by two visual systems: the ocelli and the compound eyes. 
To tease out the wing pitch asymmetry, we used computational simulation, based on the 3D kinematic data, 
in addition to statistical analyses. To deduce the driving torque, we constructed a method based on the 
Euler equations applied to the body rotation. To identify the initial response, we carried out behavioral 
experiments with the dragonfly’s vision altered. These analyses revealed a pathway from the dragonfly's 
vision to body rotation during its recovery maneuver over ~200 ms. 
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so that each wing beat (30 to 40 Hz) con- 
tained about 100 points of measurement to 
resolve the sub-wing-beat kinematics. All four 
wings and the body were tracked. We used a 
semiautomated tracking algorithm (22) and 
monitored each frame to ensure that the 
markers were tracked well, and selected flights 
with all markers visible for full analyses. We 
then reprojected the tracked markers back to 
the original film to double-check the conver- 
sions and the calibrations (movie S1). The re- 
constructed 3D kinematics from the markers 
are represented in the time series of 15 Euler 
angles and three body positions at the esti- 
mated center of mass (Figs. 2 and 3 and 
figs. S4 to S10). 

Measured wing motions showed a complex 
mixture of modulations in stroke angle, phase 
shift, inclination of the stroke plane, and 
wing pitch orientation, with variations from 
trial to trial. To detect the relevant shifts in 
wing motion, we first calculated the statisti- 
cal correlation between different wing varia- 
bles and body angular velocity, and found the 
highest correlation associated with wing pitch 
asymmetry (Fig. 3C). This was further checked 
by grouping the flights according to whether 
the body rolled in a clockwise or counter- 
clockwise direction. The reversal of pitch 
asymmetry was indeed correlated with the 
reversal of body rotation (Fig. 3D). These corre- 
lations suggest that the wing pitch asymmetry 
between the left and right wings led to the 
body roll. Still, they do not rule out the con- 
tributions of other asymmetries. In particular, 
wing stroke angles also exhibited apparent 
shifts during recovery (Fig. 3B), which warrants 
further study. 

The relationship between wing motion and 
body motion is nonlinear; it is determined by 
the Euler equations for body rotation, aero- 
dynamic forces, and the coupling among the 
inertial and aerodynamic effects. Each wing 
asymmetry can lead to a rotational mode that 
has a mixture of roll, yaw, and pitch. To under- 
stand the causal relationship between wing 
asymmetry and body rotation, we developed a 
flight model of four-winged insects (see sup- 
plementary materials). The model simulates 
the flight in six degrees of freedom, taking into 
account both unsteady aerodynamic forces (18) 
and instantaneous inertial coupling between 
wings and body motions (23), and was tested 
on hovering and maneuvering flight (figs. S13 
and S14). Using this flight model, we compared 
the effects of separately applying wing pitch or 
wing stroke asymmetries, the two leading can- 
didates for body roll (Fig. 3C). Computations 
showed that these two asymmetries elicited 
markedly different body rotations (Fig. 4). Wing 
pitch asymmetry induced predominantly a body 
roll (Fig. 44 and movie $2), confirming the 
observed statistical correlation. Moreover, when 
the amount of wing asymmetry was varied, the 


SCIENCE science.org 


A B Yaw Pitch Roll 
e~ . 
7) 
2 7 
D 
‘S 
<x : 
> 
ne) 
fe) : 
[aa] 
-50 ; ; ; 
25 50 75 100 125 150 175 200 
t (ms) 
Cc D 
40+ Experiment Model } S 
= 30} 8 
ro} © 
g 20 E 
10} A= 13.5ms 3 
aETTI Een 5 
130 140 150 160 170 180 190 -5 0 5 


t (ms) 


Normalized time 


Fig. 2. Body kinematics and a method for estimating the driving torque. (A) Definition of the body Euler 
angles. (B) Measured body Euler angles during one flight reveal the dominant rolling motion during the 
recovery sequence. (C) Zoomed-in view of the rolling angle at the end of the rolling motion. The analysis of 
the shape of this transitional curve leads to an estimate for the driving torque. (D) The roll angle from 
different recovery sequences during the transition follows a single curve after a rescaling in time and angle, 


consistent with the model. 


resulting changes in body rotation followed 
proportionally, consistent with experimental 
data (fig. S18). In contrast, stroke amplitude 
asymmetry induced predominantly a yaw motion 
of the body (Fig. 4B). The roll motion was 
substantially smaller and insufficient to re- 
turn the dragonfly to its normal orientation 
(movie S3). These results further show that 
dragonflies rely on a wing pitch asymmetry 
between the left and right wings to drive roll 
recovery. We further note that the averaged 
motion of the left and right wings during the 
recovery flight accelerates the dragonfly forward, 
and is thus different from the typical hovering 
wing motion that would have accelerated the 
dragonfly downward, given its initial upside- 
down orientation. It is this subtle interplay 
between wing asymmetry and average wing 
motion that leads to the difference in dragon- 
flies’ and flies’ maneuvers, further shown com- 
putationally (fig. S18). 

To infer the magnitude of the driving torque, 
we constructed a quick method that can be 
applied to the experimental data without re- 
sorting to computations. The trajectory of the 
roll angle contains a linear segment followed 
by a sharp transition (Fig. 2B). Because the roll 
dominates, the Euler equation for the roll angu- 
lar rate is simplified to JQ ~ —BQ + t where t 
is the driving torque and -BQ is the effective 
damping torque due to aerodynamic drag 


induced by the sum of symmetric wing motion 
and body rotation (24). Angular velocity is rel- 
atively constant during recovery, © ~ 0; there- 
fore, t = BQ. To find B, we note that at the tail 
end of the roll, the dynamics are governed by 
IQ ~ —BQ, or Q = Qolexp(-t/A)], where A = J/B 
is the time constant. Therefore, the roll angle is 
given by ‘YW(t) - ‘Y(éo) = QoA[l - exp(-t/A)], 
where A can be estimated from the data. In 
the case shown, 4 ~ 13.5 ms, which is roughly 
half a wing beat. This, together with body 
inertia, gives an estimate of the torque that 
rotates the dragonfly, t ~ BQ = 1Q/A ~ 21 uN-m. 
Comparing it to the nominal torque on each 
wing during steady state, to = MgL/4, gives a 
relative scale. For a typical body weight M of 
300 mg, with the net force acting at L ~ 20 mm 
away from the wing root, the driving torque 
requires an increase or decrease of one-third 
of the nominal torque on each wing. 

The wing symmetry needed for maneuvers 
is elicited from upstream sensory signals. How 
does a dragonfly perceive its fall? Insects are 
equipped with many sensors to detect their 
orientation and movement in space (25). In 
general, they use multiple feedback systems to 
ensure robust and versatile flight control (25-27). 
Possible candidates include their antenna, visual 
systems, legs, and other mechanosensors. We 
investigated which of these drives the right- 
ing reflex. Dragonflies, like other insects, have 
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Fig. 3. Wing kinematics and statistics show a strong correlation between 
wing pitch asymmetry and body roll angular velocity. (A) The definition of 
the wing Euler angles: the stroke angle (), the wing pitch angle relative to the 
inclined stroke plane (y), and the deviation angle of the leading edge from 

the stroke plane (@). (B) The measured wing Euler angles during one flight. 
Additional kinematics are shown in figs. S4 to S10. Blue, left wing; red, right wing; 
solid line, forewing; dashed line, hindwing. (C) The correlation between the 
body roll angular velocity and the left-right wing asymmetries from multiple 


two visual systems: the compound eyes and 
the ocelli. The ocelli are three simple eyes that 
can serve as a horizon detector and trigger a 
head rotation toward the light (28, 29) We 
performed behavioral experiments on dragon- 
flies with blocked vision to investigate the role 
of vision in the righting reflex. The reaction 
time statistics show that vision is critical for 
eliciting the observed rolling behavior. When 
their visual systems were intact, dragonflies 
reacted in ~100 ms and had recovered normal 
orientation in ~220 ms (7 dragonflies, 37 trials, 
reaction time 99 + 26 ms) (movie S1). When 
the ocelli were blocked, the reaction time be- 
came longer (3 dragonflies, 13 trials, reaction 
time 158 + 93 ms), and all 13 recovery attempts 
were unsuccessful to gain level flight within 
the filming volume (movie S4). When the com- 
pound eyes were blocked, the recovery flight 
was more erratic, occasionally with the head 
plunging down first, followed by a body pitch 
assisted by the abdominal bending motion 
(2 dragonflies, 14 trials, reaction time 156 + 
63 ms, total roll recovery time 272 + 35 ms) 
(movie S5). When both the ocelli and the 
compound eyes were blocked, dragonflies 
showed a mixture of behavior. In many cases, 
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they did not flap their wings, falling like 
leaves within the filming volume (3 dragon- 
flies, 35 trials) (movie S6). In the cases where 
they flapped wings, none of the flapping mo- 
tion elicited the rolling response (3 dragon- 
flies, 13 trials) (S7). The wings typically flapped 
after the body passively pitched, or just before 
reaching the ground (movie S6). Although 
these experiments do not rule out the partic- 
ipation of other mechanosensory pathways 
during the recovery maneuver, they suggest 
that the visual system is the first and dom- 
inant response pathway, and the combina- 
tion of ocelli and compound eyes ensures an 
accurate recovery. Related work (30) using a 
setup similar to ours (/) also found that vision 
plays a main role in hoverflies’ aerial righting. 

Dragonflies often perform mid-air righting 
maneuvers during their prey-capture flights 
(5) and must recover their normal position 
from different orientations. To right from an 
upside-down orientation, a dragonfly can either 
roll or pitch 180°; we have found that it almost 
always does so using the energetically less costly 
roll maneuver. Moreover, dragonflies of different 
species show the same rolling dynamics, which 
suggests that this is a generic behavior (supple- 


experiments. Each data point represents the averaged value over a wing beat 
for the front (blue) and rear (red) pair of wings (5 dragonflies, 55 wing beats). 
(D) Details of the wing pitch during a wing beat in three scenarios: rolling 
right, rolling left, and forward flight. The solid line represents the averaged 
value; the colored region denotes one standard deviation. Top panels, front 
wings; bottom panels, rear wings. The reverse in wing pitch asymmetry 
leads to a reverse in the rolling direction, confirming the strong correlation 


mentary text). When released from different 
initial orientations, the righting mode transi- 
tioned from rolling to pitching motion (fig. $3). 
In more than 300 experiments under normal 
vision, dragonflies always actively flapped their 
wings to right themselves. The passive mech- 
anism suggested in a related study (27) refers 
to the body pitching motion of anesthetized 
dragonflies with motionless wings, released 
from a perch with an initial pitch velocity (sup- 
plementary text). In awake dragonflies, passive 
body pitch can occur, before wing flapping, as 
a result of either an initial leg kick or a back- 
pitched orientation, but is insufficient for 
righting. The subsequent flapping motion 
does not necessarily explore the passive body 
rotation. It can produce a rolling motion 
when the body is back-pitched (movie S8) or 
can pitch the body in the opposite direction 
of the initial pitch (movie S9). These specific 
cases, as well as the general maneuvers that 
involve a mixture of roll, yaw, and pitch, show 
unequivocally that dragonflies use active mech- 
anisms to right themselves. 

Our results point to a pathway from the 
dragonfly’s visual system to the direct muscles 
regulating wing pitch that drive body roll when 
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Fig. 4. Computational model compares the effect of wing pitch and wing stroke asymmetries on body rotation. (A, C, and D) Wing pitch asymmetry leads to 
roll recovery, as seen in the experiments. (B) Wing stroke asymmetry, by contrast, leads to a body rotation with a complex mixture of roll, yaw, and pitch and 
underpredicts the roll seen in experiments. Simulation methods and additional tests are in the supplementary materials. 
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Cross-biome synthesis of source versus sink limits 
to tree growth 
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Uncertainties surrounding tree carbon allocation to growth are a major limitation to projections of forest 
carbon sequestration and response to climate change. The prevalence and extent to which carbon 
assimilation (source) or cambial activity (sink) mediate wood production are fundamentally important 
and remain elusive. We quantified source-sink relations across biomes by combining eddy-covariance 
gross primary production with extensive on-site and regional tree ring observations. We found 
widespread temporal decoupling between carbon assimilation and tree growth, underpinned by 
contrasting climatic sensitivities of these two processes. Substantial differences in assimilation-growth 
decoupling between angiosperms and gymnosperms were determined, as well as stronger decoupling 
with canopy closure, aridity, and decreasing temperatures. Our results reveal pervasive sink control over 


tree growth that is likely to be increasingly prominent under global climate change. 


orest ecosystems currently constitute a 
net carbon (C) sink that offsets ~25% of 
yearly anthropogenic C emissions, thus 
actively mitigating climate change (J). C 
allocation to aboveground wood biomass 
is the largest contributor to vegetation C 
storage over climate-relevant time scales. How- 
ever, wood C allocation is poorly understood 
and is a major uncertainty for projections of 
future forests’ C storage potential (2). The 
common representation of wood growth as 
a linear function of C assimilation has been 
identified as a major structural limitation of 
current vegetation models (3, 4). The develop- 
ment of improved C allocation schemes cur- 
rently lacks a solid empirical and mechanistic 
basis (5). Thus, there is an urgent need to 
illuminate the relationship between C assim- 
ilation and tree growth. 
A fundamental debate revolves around 
the degree to which C assimilation via photo- 
synthesis (source limitation) versus direct 


environmental limitations to cambial cell 
development (sink limitation) controls wood 
growth (6). As reflected by C allocation schemes 
in the vast majority of vegetation models, 
source limitation has been the dominant 
paradigm for decades (4). Yet a growing body 
of literature indicates that cambial activity is 
typically more sensitive than photosynthesis 
to a range of environmental conditions, in- 
cluding low water availability, temperature, 
and nutrient availability (7-11). The preva- 
lence of source versus sink limitations to tree 
growth has far-reaching implications for forest 
dynamics under climate change, because these 
processes will likely respond differently to 
global change (6-9), potentially shifting C 
allocation away from the stem. Substantial 
indirect evidence supports the hypothesis that 
C sink limitations may be particularly impor- 
tant in cold, dry, and late-successional forests. 
For example, elevated concentrations of non- 
structural C (e.g., starch and sugars) are frequently 


observed in colder environments or during 
drought (8, 12). Additionally, free air CO. en- 
richment (FACE) experiments tend to show 
that increasing CO, concentration improves 
tree growth in early-stage forests but often not 
in mature forests, perhaps because of stronger 
nutrient limitations (73-15). But the relatively 
small scale and replication of FACE experi- 
ments, especially in mature forests, prevents 
general conclusions regarding the linkage be- 
tween C source and sink dynamics in trees. 
Colocated assessments of gross primary 
productivity (GPP; e.g., by eddy-covariance) and 
tree growth theoretically enable evaluation of 
the coupling between tree C assimilation and 
growth increment. Past studies adopting such 
an approach were nonetheless limited by data- 
set size (site number <5) and yielded contrasting 
findings, with no clear explanation of observed 
differences (16-27). The advent of large-scale, 
long-term networks of flux towers measuring 
C exchange across a diverse assemblage of 
biomes, in combination with a growing num- 
ber of both on-site and global tree ring data- 
sets, opens new opportunities to characterize 
C source-sink relationships at larger temporal 
and spatial scales. Here, we compiled a new 
dataset comprising eddy-covariance GPP re- 
cords at 78 forest flux sites (table S1), together 
with on-site tree ring width chronologies at 
a subset of 31 sites (RWon-site; table S2) as well 
as 1800 nearby regional ring width chronol- 
ogies (RWyegion). GPP and RW records were 
detrended to remove low-frequency signals 
(e.g., stand structure; tree age and size) and 
were aggregated such that records were rep- 
resentative of year-to-year variations of stand 
C assimilation and aboveground woody growth, 
respectively (22). This C assimilation and tree 
growth dataset extends across most of Europe 
and North America, encompassing a variety of 
forested biomes from semi-arid to boreal, and 
representing both angiosperm and gymno- 
sperm tree species (Fig. 1, fig. S1, and table S3). 
We used this dataset to (i) quantify the strength 
of tree C source-sink relationships across 
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Fig. 1. Spatial distribution of gross primary production (GPP) and regional ring width (RW,egion) sites used in this study. Crosses, RW,egion sites; circles, GPP 
sites. Circle size denotes the number of RWregion Site x year observations associated with each flux tower. GPP sites that further include on-site RW are shown in 
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biomes, (ii) identify the seasonality of these 
relationships, and (iii) explore their environ- 
mental drivers. 

We first characterized C source and sink 
relationships at the regional scale by statistically 
accounting for the decrease of the correlation 
between GPP and RWeeegion (7region) With in- 
creasing geographic and climatic distances, as 
well as with an index of species dissimilarity 
between sites (22) (fig. S2). As expected from 
reported tree growth synchrony over large dis- 
tances (23), we observed sustained correlations 
up to ~500 km. We thus built on this widespread 
ecological feature to derive robust regional 
estimates of tree C assimilation and growth 
correlation, 7’p_o, for theoretical colocated sites 
of identical climate and species composition 
(i.e., spatial distance, climatic distance, and 
species dissimilarity of 0), integrating over 
multiple time scales. We then complemented 
regional-scale analyses with paired GPP and 
on-site tree ring correlations (7on-site; See annual 
GPP and RW series in fig. S3). The latter dataset 
has a lower sample size than the regional 
network but is model-free and therefore re- 
duces the risk of methodological artifacts. 

Both on-site and regional correlations showed 
an overall weak association between tree C 
assimilation and growth, with 7on-site and 
Tp-o reaching maxima of 0.26 and 0.38, 
respectively (Fig. 2, A and B). The observed 
difference between on-site and regional esti- 
mates could be offset by setting species dis- 
similarity to the average encountered for 
RWon-site. resulting in a maximum regional 
correlation of 0.27 (22). RWyegion Observations 
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Fig. 2. Temporal structure of GPP versus RW correlations. (A) Seasonal on-site correlations (Fron-site)- 
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(from 1 to 12 months). (B) Regional-based estimates of null distance correlations (rp-9) modeled by 

eq. S1 (see fig. S2 for an illustration of the 12-month case from current-year January). Significant correlation 
values are displayed on top of corresponding cells (all p values < 0.05, except for boldface: p < 0.01). 
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Fig. 3. Spatial variations and environmental drivers of GPP versus RW correlations. (A) Effect of 
biome on on-site correlations (on-site) observed in the period with highest correlation average (previous 
November through current October; nonsignificant). Boxes represent the median and first and third quartiles. 
Whiskers represent 1.5 times the interquartile range. Dots represent individual r values; dot size is 
proportional to the underlying number of observations. (B) Effect of stand structure and climatic variables 
on current- and previous-year regional-based estimates of null distance correlations (rp-9). Error bars denote 
SE. All effects are highly significant [p < 0.001: gymnosperm proportion, species richness, mean annual 
climatic water deficit (MACWD), mean annual temperature (MAT)] except for that of leaf area index (LAI) on 


current-year correlations (nonsignificant). 


partially build on the International Tree-Ring 
Data Bank, where sampling is often biased 
toward dominant and climate-sensitive trees 
(24). However, we find that this is unlikely to 
be an issue here, as dominant trees account for 
most of stand GPP and we statistically cor- 
rected for differences in climate (22). Overall, 
similar regional and on-site results show the 
suitability of regional RW data to quantify 
local GPP-RW correlations and broad agree- 
ment between the two approaches, both of 
which suggest a substantial decoupling be- 
tween C assimilation and tree growth across 
multiple biomes. 

On-site and regional GPP-RW correlations 
exhibited a similar temporal structure (22), 
with correlation magnitude increasing with 
the length of the GPP integration period and 
maximum correlations being found at the 
10- and 12-month scales for 7 on-site ANd Tregion, 
respectively (Fig. 2). This supports the often- 
implicit assumption that annual tree ring in- 
crements are most strongly related to annual 
carbon assimilation (27). Overall, RW was best 
correlated to GPP integrated over the period 
spanning previous-year September or November 
to current-year August, indicating a short 
temporal lag between C assimilation and tree 
growth, consistent with a previous study (20). 
This result suggests that despite estimated 
low C source limitation of tree growth overall, 
excess photosynthates are stored over winter 
(following radial growth cessation) and are 
allocated to the next year’s growth. This phe- 
nomenon is often cited as a potential explana- 
tion for delayed climatic effects on tree growth 
and growth autocorrelation (25, 26). Analysis 
of multiyear trends (table S4) nonetheless in- 
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dicates weak association of RW and GPP at this 
scale, contrary to the hypothesis that C storage 
might lead to the convergence of tree growth 
and C assimilation over the long term (27). 
We found large spatial variations in the 
strength of GPP-RW coupling (Fig. 3). Weighted 
deciles of maximum on-site 7 ranged from -0.08 
to 0.60, consistent with previously reported val- 
ues (16-21). These spatial variations imply a 
range of source versus sink limitations. We 
estimate that because of approximations and 
measurement errors, RW-GPP correlations 
between 0.7 and 0.9 would be expected under 
strong source control of tree growth (22). The 
high end of the observed correlation range 
(0.6 < Ton-site < 0.9: 10% of observations) thus 
appears reflective of substantial source limi- 
tation of tree growth at the corresponding sites, 
whereas the majority of sites display evidence 
consistent with sink limitations. Although we 
did not observe a biome effect on on-site cor- 
relations, regional-scale 7 was significantly 
related to several environmental factors (Fig. 3B). 
Specifically, gymnosperm proportion had a posi- 
tive effect on current-year Tyegion but a negative 
one on previous-year 7yegion;} this suggests that 
gymnosperm growth relies more directly on 
current-year and less on previous-year C as- 
similation than angiosperms, reflecting fun- 
damental physiological differences between 
these two clades. A small but positive effect of 
species richness ON 7yegion Suggests a link 
between species diversity and C use efficiency 
(i.e., the ratio between net and gross primary 
production), which may arise as a result of 
increased complementarity with structural 
and functional heterogeneity (28). Decreasing 
Tregion With increasing leaf area index indicates 


that closed-canopy forests, which under a given 
climate tend to be older and more nutrient- 
limited than open-canopy forests, are prone 
to stronger decoupling between C source and 
sink activity. This result agrees with observa- 
tions that CO, growth fertilization tends to 
fade in older, nutrient-limited forests (5). Last, 
Tregion Was found to be positively related to site 
temperature and water availability, consistent 
with known biophysical controls of cambial 
activity and the ensuing prediction that sink 
limitations are stronger under colder and drier 
conditions (6-9). These combined results draw 
a clear picture that increasing resource limita- 
tion, aridity, and low temperatures promote C 
source-sink decoupling across a broad range of 
biomes. 

Finally, decoupling of C assimilation and 
tree growth was further revealed by diverging 
climate sensitivities of these two processes 
(22) (Fig. 4). As anticipated from C assimila- 
tion and wood formation literature, GPP and 
RW both responded positively to temperature 
and water availability but were weakly cor- 
related with photosynthetically active radiation 
(29, 30). However, their seasonal variability 
differed markedly, indicating that fundamen- 
tally different physiological processes may 
limit C assimilation and tree growth. GPP 
responded mostly to spring and fall temper- 
atures as well as to summer water availability, 
suggesting an important role of temperature- 
triggered leaf phenology controlling annual 
GPP (3D. In contrast, RW appeared to be most 
strongly related to year-round water availa- 
bility, with a weak positive temperature effect 
peaking in summer. This agrees with previ- 
ous observations that tree growth is primarily 
and increasingly water-limited in the study 
regions (29) and is consistent with the central 
role of cell turgor in controlling cambial cell 
division and expansion (7, 17). Overall, this 
analysis shows the large but contrasting cli- 
mate sensitivity of the tree growth and photo- 
synthesis proxies used here. This is contrary 
to the expectation that RW and GPP would 
have weaker but similar climate sensitivity if 
low RW-GPP were due primarily to large mea- 
surement errors. These results instead strongly 
suggest that weak control of C assimilation 
over tree growth is underpinned by fundamen- 
tally contrasting source and sink processes with 
diverging environmental sensitivities (6). 

Taken together, our results provide consist- 
ent evidence for the pervasive influence of 
nonphotosynthetic processes on tree radial 
growth. This conclusion has major implica- 
tions for projections of forest dynamics and 
feedbacks with the global C cycle and climate 
change, as most global vegetation models 
essentially simulate forest productivity and C 
sequestration as a linear function of C as- 
similation (3, 4). Because, compared to C 
assimiliation, sink processes are relatively more 
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Fig. 4. GPP and RWregion 
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photosynthetically active 
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scale over the period 
1990-2015 (to the extent 
of series span). Error bars 
denote SE. *p < 0.05, 
**p < 0.01, ***p < 0.001. 
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sensitive to water availability and less to tem- 
perature constraints (Fig. 4), as well as not 
directly dependent on atmospheric CO, con- 
centration, unaccounted-for and widespread 
sink limitations could lead to overestimating 
the positive effect of warming and CO, fertil- 
ization while underestimating the negative 
effect of increasing water stress on forest pro- 
ductivity. Overall, accounting for sink limita- 
tions on tree growth may lower projections 
of future forest C sequestration in many re- 
gions and could thus potentially compromise 
forests’ potential for climate change mitiga- 
tion. Hence, our results underscore that in- 
corporation of sink-limited carbon allocation 
schemes in global vegetation models is urgently 
needed (3, 4). 

Our results nonetheless indicate a certain 
degree of interaction between C source and sink 
activities, as suggested by the weak but signif- 
icantly positive correlations observed between 
GPP and RW, as well as their temporal and 
spatial variations. Such dynamic coupling 
between C assimilation and tree growth po- 
tentially reconciles contrasting observations 
of the prevalence of source versus sink lim- 
itations (75) and provides a bridge between 
current source-centered representations of tree 
growth and sink-driven schemes. Variations in 
the prevalence of source versus sink limitations 
to tree growth further highlight the importance 
of understanding their drivers (5). Our findings 
indicate that across biomes, the occurrence 
of sink limitations is highly consistent with 


SCIENCE science.org 


pey 


KKK 


known biophysical controls of cambial cell 
division, notably turgor-driven growth. Because 
turgor is a central mechanism of growth across 
scales and has a large potential for the inte- 
gration of several relevant processes as well 
as parameter-parsimonious upscaling (32), the 
turgor-driven growth framework appears 
to be a promising way to progress toward 
developing mechanistic sink-limited schemes 
in vegetation models. 

Key remaining uncertainties concern whether 
our results can be generalized to other biomes 
such as tropical forests, which are central to the 
global C cycle, and the dynamic nature of source 
and sink interactions. Likewise, characterizing 
the degree of C source and sink decoupling at 
decadal to centennial scales is relevant to cli- 
mate change but currently remains elusive be- 
cause of the temporal depth of C assimilation 
measurements. Source-sink decoupling over 
both short and longer time scales implies less 
C limitation of tree growth. Weak C limita- 
tion of tree growth under certain conditions 
nonetheless raises the question of the fate of 
excess C. Closing trees’ C budget and elucidat- 
ing drivers of C allocation to different sinks, 
specifically stem versus underground growth 
and C storage, thus emerges as a critical way 
forward (14). 


REFERENCES AND NOTES 

1. P. Friedlingstein et al., Earth Syst. Sci. Data 12, 3269-3340 
(2020). 

2. T. A.M. Pugh et al., Biogeosciences 17, 3961-3989 (2020). 


3. A.D. Friend et al., Ann. For. Sci. 76, 49 (2019). 

4. S. Fatichi, S. Leuzinger, C. Korner, New Phytol. 201, 1086-1095 
(2014). 

5. F. Babst et al., Trends Plant Sci. 26, 210-219 (2021). 

6. C. Kérner, Curr. Opin. Plant Biol. 25, 107-114 (2015). 

7. T.C. Hsiao, Annu. Rev. Plant Physiol. 24, 519-570 (1973). 

8. B. Muller et al., J. Exp. Bot. 62, 1715-1729 (2011). 

9. R.L. Peters et al., New Phytol. 229, 213-229 (2021). 

. |. Cornut et al., For. Ecol. Manage. 494, 119275 (2021). 

A. Cabon, R. L. Peters, P. Fonti, J. Martinez-Vilalta, 

M. De Caceres, New Phytol. 226, 1325-1340 (2020). 

G. Hoch, C. Korner, Glob. Ecol. Biogeogr. 21, 861-871 (2012). 

T. Klein et al., J. Ecol. 104, 1720-1733 (2016). 

M 

A 

A 


eo 


. Jiang et al., Nature 580, 227-231 (2020). 
. P. Walker et al., New Phytol. 229, 2413-2445 (2021). 


Da wh 


Change Biol. 12, 1378-1389 (2006). 

7. F. Babst et al., New Phytol. 201, 1289-1303 (2014). 

8. N. Delpierre, D. Berveiller, E. Granda, E. Dufréne, New Phytol. 

210, 459-470 (2016). 

9. C. Pappas et al., Agric. For. Meteorol. 290, 108030 (2020). 

20. A. Teets et al., Agric. For. Meteorol. 249, 479-487 (2018). 

21. M. Lempereur et al., New Phytol. 207, 579-590 (2015). 

22. See supplementary materials. 

23. M. del Rio et al., For. Ecol. Manage. 479, 118587 (2021). 

24. S. Klesse et al., Nat. Commun. 9, 5336 (2018). 

25. R. Zweifel, F. Sterck, Front. For. Glob. Change 1, 9 (2018). 

26. A. Gessler, K. Treydte, New Phytol. 209, 1338-1340 
(2016). 

27. C. M. Gough, C. S. Vogel, H. P. Schmid, H. B. Su, P. S. Curtis, 
Agric. For. Meteorol. 148, 158-170 (2008). 

28. S. Mensah, R. Veldtman, A. E. Assogbadjo, R. Glelé Kakai, 
T. Seifert, Ecol. Evol. 6, 7546-7557 (2016). 

29. F. Babst et al., Sci. Adv. 5, eaat4313 (2019). 

30. N. Delpierre et al., Agric. For. Meteorol. 154-155, 99-112 

(2012). 
. J. Xia et al., Proc. Natl. Acad. Sci. U.S.A. 112, 2788-2793 
(2015). 

32. A. Potkay, T. Haltta, A. T. Trugman, Y. Fan, Tree Physiol. 42, 

229-252 (2022). 


w 
aq 


ACKNOWLEDGMENTS 


We thank C. Hanson, S. Wharton, R. Brooks, and S. Klesse for 
contributing data to this study, as well as contributors at the 
nternational Tree-Ring Data Bank, FLUXNET, and AmeriFlux. 
Funding: Supported by USDA National Institute of Food and 
Agriculture, Agricultural and Food Research Initiative Competitive 
Program, Ecosystem Services and Agro-Ecosystem Management, 
grant 2018-67019-27850 (A.C., S.A.K., and W.R.L.A.); the David and 
Lucile Packard Foundation and NSF grants 1714972, 1802880, 
2044937, and 2003017 (W.R.L.A.); NSF Ecosystem Science 
cluster grant 1753845, USDA Forest Service Forest Health 
Protection Evaluation Monitoring program grant 19-05, and 

DOE Environmental System Science program grant DE- 
SC0022052 (S.A.K.); Arctic Challenge for Sustainability I] grant 
JPMXD1420318865 (M.U.); USDA National Institute of Food and 
Agriculture, Agricultural and Food Research Initiative 
Competitive Program grant 2017-67013-26191 (J.T.M.); and DOE 
Office of Biological and Environmental Research grant DE- 
SC0010611 and NSF Directorate for Biological Sciences grant 
1241851 (D.J.M.). Funding for the AmeriFlux data portal was 
provided by the US Department of Energy Office of Science. 
Author contributions: Conceptualization: A.C., W.R.L.A. 
Methodology: A.C., W.R.L.A., S.A.K. Data contributions: All 
co-authors. Investigation: A.C., W.R.L.A., S.A.K. Visualization: 
A.C. Funding acquisition: W.R.L.A. Writing-original draft: A.C., 
W.R.L.A., S.A.K. Writing-review and editing: All co-authors. 
Competing interests: The authors declare that they have no 
competing interests. Data and materials availability: All 
processed data used for the analyses are available on Dryad 
(DOI: 10.5061/dryad.15dv41nzt) and the code is available on 
Zenodo (DOI: 10.5281/zenodo.6033963). 


SUPPLEMENTARY MATERIALS 


science.org/doi/10.1126/science.abm4875 
Materials and Methods ——s—“‘i‘—SSCS 
Figs. S1 to S5 

Tables S1 to S4 

References (33-55) 


Submitted 21 September 2021; accepted 16 February 2022 
10.1126/science.abm4875 


13 MAY 2022 * VOL 376 ISSUE 6594 761 


RESEARCH | REPORTS 


SOLAR CELLS 


Scalable processing for realizing 21.7%-efficient 
all-perovskite tandem solar modules 


Ke Xiao*, Yen-Hung Lin, Mei Zhang’, Robert D. J. Oliver®, Xi Wang*®, Zhou Liu’, Xin Luo?, Jia Li®, 
Donny Lai’, Haowen Luo’, Renxing Lin’, Jun Xu’, Yi Hou®, Henry J. Snaith®*, Hairen Tan™* 


Challenges in fabricating all-perovskite tandem solar cells as modules rather than as single-junction 
configurations include growing high-quality wide-bandgap perovskites and mitigating irreversible 
degradation caused by halide and metal interdiffusion at the interconnecting contacts. We demonstrate 
efficient all-perovskite tandem solar modules using scalable fabrication techniques. By systematically 
tuning the cesium ratio of a methylammonium-free 1.8—-electron volt mixed-halide perovskite, we 
improve the homogeneity of crystallization for blade-coated films over large areas. An electrically 
conductive conformal “diffusion barrier” is introduced between interconnecting subcells to improve the 
power conversion efficiency (PCE) and stability of all-perovskite tandem solar modules. Our tandem 
modules achieve a certified PCE of 21.7% with an aperture area of 20 square centimeters and retain 75% 
of their initial efficiency after 500 hours of continuous operation under simulated 1-sun illumination. 


onolithic all-perovskite tandem solar 

cells show great promise for large- 

scale photovoltaic (PV) applications 

with the advantage of low-cost solu- 

tion processing (J-3). However, certi- 
fied power conversion efficiencies (PCEs), which 
can reach up to 26.4% (4, 5), have only been 
achieved in small-area devices with lab-scale 
spin-coating techniques that limit scalability. 
To enable large-area fabrication of perovskite 
films, deposition techniques such as spray 
coating (6), inkjet printing (7), blade coating 
(8, 9), slot-die coating (10, 11), and vacuum 
evaporation (12) have been reported. Solution- 
based fabrication routes all involve solvent 
engineering to modulate the crystallization 
dynamics, but the solvent systems that are in 
use now for scalable coating of state-of-the-art 
~1.5-eV-bandgap perovskite films are incom- 
patible with those of the ~1.8-eV-bandgap 
wide-bandgap (WBG) perovskites that are 
needed for all-perovskite tandem solar mod- 
ules (10). The higher bromide concentration in 
WBG perovskites leads to variations in crys- 
tallization kinetics, and precursor solutions 
are limited by the low solubility of lead and 
cesium bromide salts (13). These constraints 
hinder the scalable fabrication of high-quality 
WBG perovskites for all-perovskite tandem 
solar modules (/4, 15). 
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Another challenge in fabricating perovskite 
solar modules is linked to the reaction of 
halides and metal electrodes at the intercon- 
necting subcells. The interdiffusion between 
the perovskite absorber and metal creates deep 
defect states at either the interface or the bulk 
of the perovskites (16-19). Unlike small-area 
perovskite solar cells (PSCs), perovskite solar 
modules require a three-step laser or me- 
chanical scribing (namely P1, P2, and P3) to 
connect the subcells in series (20-25). The 
direct contact between perovskites and metal 
electrodes at the interconnecting areas between 
cells leads to subsequent halide-metal inter- 
diffusion and limits the performance and 
stability of the modules (26). If the tandem-cell 
recombination junction is a highly conductive 
transparent conducting oxide (27-29), which 
is often the case, then metal-to-recombination 
layer contact will also lead to short-circuiting 
of one or both of the subjunctions. Injecting 
two-dimensional (2D) barrier materials, such 
as 2D nanostructured graphitic carbon nitride, 
between subcells can inhibit such interdiffu- 
sion (26). However, the poor electronic prop- 
erty of these 2D materials adversely causes 
an undesirable, large hysteresis in the solar 
modules. Furthermore, adding a 2D diffusion 
barrier layer between subcells complicates the 
overall fabrication process and reduces the 
geometric fill factor (GFF) because it requires 
two more independent processes (spin-coating 
and injecting) and necessitates a much wider 
space gap between subcells. A one-step pro- 
cess to deposit a thin conformal diffusion 
barrier (CDB) would not only improve cost ef- 
fectiveness but also reduce the cell-to-module 
efficiency gap. 

In this work, we controlled the homogeneity 
of crystallization in WBG perovskites over 
large areas by tuning the content of mono- 
valent inorganic cation cesium. This strategy 
enabled the fabrication of 1-cm? all-perovskite 


tandem solar cells with a steady-state PCE of 
24.8% through scalable processing techniques. 
A CDB consisting of atomic-layer-deposited 
SnO, (ALD-SnO.) served as both the vertical 
electron extractor and the lateral diffusion 
barrier between interconnecting subcells. The 
CDB inhibited halide-metal interdiffusion and 
avoided the reaction between perovskites and 
metal electrode. Using the ALD-SnO,-based 
CDB, we demonstrated all-perovskite tandem 
solar modules with a certified PCE of 21.7% 
(aperture area of 20.25 cm?). Encapsulated 
tandem solar modules retained 75% of their 
initial performance after aging for 500 hours 
at the maximum power point (MPP) operation 
under simulated 1-sun illumination in an am- 
bient condition. 

We first attempted to blade coat the WBG 
perovskite films with a composition of 
Cso2FAg.gPbl,.sBr).2 (where FA is formamidin- 
ium), which was used for the spin-coating 
process in our previous works (2, 3). The 
relatively volatile solvent 2-methoxyethanol 
(2-ME) favors rapid deposition of a uniform 
perovskite film with low bromide content 
(21, 30). However, undesirable crystal precip- 
itation occurred when adding WBG perovskite 
precursors into a 2-ME and dimethyl sulfoxide 
(DMSO) mixed solvent, given the limited solu- 
bility of lead and cesium bromide salts in 2-ME 
(fig. SI). A stable and transparent precursor 
solution was obtained when the WBG perov- 
skite precursors were added to the coordinating 
N,N-dimethylformamide (DMF) and DMSO 
mixed solvent (fig. S1). We used several solvent- 
quenching methods—such as vacuum flashing, 
hot casting, and gas quenching—to remove 
excess DMF and DMSO solvent after the blade- 
coating step, but none of these methods resulted 
in dense and uniform perovskite layers (fig. $2). 
We found that optimizing a range of blading 
parameter spaces—that is, blade speed, quench- 
ing gas pressure, and the gap between the blade 
and the substrate—was not enough to obtain 
high-quality, uniform Cso.FAo gPbI,.sBry.2 
perovskite films (fig. S3). We also attempted to 
blade-coat films with neat FA cation and a 
composition of FAPbI, sBr,2; however, the re- 
sulting films exhibited an obviously nonperov- 
skite 5 phase (fig. S4). 

We found that the crystal nucleation rate 
could be controlled by finely tuning the Cs con- 
tent (denoted as x in the Cs,FA,_,PbI,.sBry2 
formula) in conjunction with a gas-assisted 
blade-coating technique (Fig. 1A). During the 
gas-quenching step, the nucleation process was 
initiated under supersaturation conditions, 
which was evident from the observation that 
the wet perovskite film turned brown with 
increased Cs content (fig. S5). We observed the 
enlargement of grain sizes and the flattening 
of the film surface with higher Cs incorpora- 
tion (up to 35 mol %) into the perovskite. For film 
where & is equal to 0.35 (Cs935FAg65Pbl.sBry2), 
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Fig. 1. Fabrication of Cs,FA;_,Pbl;.sBr:.2 WBG perovskite films using blade 
coating. (A) Schematic illustration of gas-assisted blade coating. (B and ©) SEM 
images (B) and x-ray diffraction patterns (C) of Cs,FA;-,Pbl; gBr),2 perovskite films. The 
scale bars in the SEM images are 1 um. a.u., arbitrary units. (D) J-V curves of champion 


we observed the largest grain sizes with a uni- 
form surface, as observed from scanning electron 
microscopy (SEM) and atomic force microscopy 
(AFM) images (Fig. 1B and fig. S6). It should 
be noted that the grains or grain boundaries 
seen in SEM and AFM observations are only 
features of morphological domains or domain 
boundaries (37). The Cso.35FAo.g5PbI1.sBry.2 
perovskite film also exhibited the highest 
crystallinity, as indicated by the x-ray diffrac- 
tion patterns (Fig. 1C and fig. S7A). 

The grazing-incidence wide-angle x-ray 
scattering and the corresponding diffraction 
mottling intensity profiles integrated along 
the ring with scattering vector (q) equal to 
1.0 A‘ are shown in fig. $8. For blade-coated 
Cso.35FAo.65PbI,.gBry.2 perovskite films, the dif- 
fraction mottling at the azimuth angles of 45° 
(135°) and 90° became distinct compared with 
the ambiguous diffraction mottling at 60° 
(120°) for spin-coated Cs 9FAp gPbI, sBry.2 pe- 
rovskite films. This new pattern implied that a 
distinct stacking orientation developed along 
with (100) crystallographic planes for the WBG 
perovskite films deposited through gas-assisted 
blade coating. Large crystal grains grown per- 
pendicular to the substrate throughout the 


Cs,FA,,Pbl, Bry. 
wet perovskite film 
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whole film were observed in blade-coated 
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perovskite films (fig. S8). By contrast, smaller 
grains with multiple grain boundaries through- 
out the film were formed for spin-coated films. 
We observed that the grain orientation and 
crystallinity of blade-coated films were strongly 
related to the Cs content, which had only a 
weak impact on spin-coated films (fig. S9). 

We noted a linear increase in the bandgap 
of Cs,FA;_,PbI,.gBr,2 perovskites when 2 was 
increased up to 0.35. However, nonlinearity 
in the optical bandgap was seen when 2 was 
greater than 0.35 (fig. S10), which may have 
been caused by phase segregation in the 
perovskite films (fig. S7B). Perovskites with x 
less than 0.3 suffered from light-induced phase 
segregation under high illumination intensities 
(ie., 10 suns), which was evident from the mul- 
tiple emission peaks observed in the steady- 
state photoluminescence (PL) spectra (fig. S11). 
The reduced photoinduced phase segregation 
in film with x equal to 0.35 could be due to the 
reduced lattice strain with a higher Cs ratio 
(32) and/or the change in thermodynamics 
related to the intermixing of ions or ion mi- 
gration within the crystal lattice (33, 34). 

We then evaluated the effect of Cs con- 
tent on the optoelectronic properties and PV 
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Cs,FAy-»Pbh gBri.2 PSCs. (E) QFLS calculated from the PLQY for the respective 
perovskite films and the perovskites with transport layer stacks that were investigated 
in the study. The Shockley-Queisser radiative limit and the experimental Vo, are 
plotted for each composition. non-rad. rec. loss, nonradiative recombination loss. 


negative (p-i-n)-structured WBG PSCs. The 
Cso.35F Ao 65PbI,.gBry.2 perovskite films showed 
the longest carrier lifetime, as determined 
by PL decay (fig. S12). The performance dis- 
tribution, external quantum efficiency (EQE), 
and steady-state power output of various 
Cs,FAy_2PbI;.gBr,,.-based devices are sum- 
marized in fig. S13, suggesting an optimal 
composition of Csg.35FAo,65Pbl,.gBr,.2. The cham- 
pion Cso.35FAp 65PbI;.sBr;.2 device achieved a 
PCE of 17.2%, with an open-circuit voltage 
(Voc) of 1.266 V, a short-circuit current density 
(Jee) of 16.8 mA cm™, and a fill factor (FF) of 
80.9% (Fig. 1D and table S1). The blade-coated 
devices (xv = 0.35) exhibited comparable per- 
formance to the spin-coated ones (a = 0.20; 
fig. S14), and the spin-coated devices with var- 
ious Cs contents showed similar performance 
(fig. S15). 

To understand how the Cs content (w = 0.2 
to 0.4) affects the performance of blade-coated 
cells, we measured the transient photovoltage 
decay under open-circuit conditions (fig. S16). 
The photovoltage-decay lifetime t of the 
Cso.35FAo.65PbI;.gBry.2 device (86 us) was higher 
than that of the Cso 2FAp.sPbI,.gBry.. device 
(31 us), indicating a reduced recombination 
when the ratio of Cs was increased from 0.2 to 
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Fig. 2. Full fabrication of all-perovskite tandem solar cells using scalable 
techniques. (A) Configuration schematic of an all-perovskite tandem device 
fully fabricated using scalable techniques. The processing technique for each 
mage of the all-perovskite 
tandem device. (C) J-V curves of the champion tandem solar cell (aperture 


layer is indicated to the left. (B) Cross-sectional SEM i 


0.35. To further elucidate V,. improvement of 
the PSCs with various Cs ratios, we investigated 
the PL quantum yield (PLQY) of isolated perov- 
skite thin films and device stacks that had both 
charge-transporting layers present (fig. S17). 
These measured PLQY values were then used 
to derive the quasi-Fermi level splitting (QFLS) 
in the respective perovskite materials and p-i-n 
stacks (Fig. 1E). Notably, the calculated QFLS 
values of the p-i-n stacks corroborated well with 
the V,. of the WBG PSCs and suggested that 
nonradiative recombination losses were reduced 
by increasing Cs to an & of 0.3 to 0.35. 

We then investigated the uniformity of 
Cso.35FAo.65Pb],.gBry.2 perovskite films that 
were blade coated over a large area (6-cm- 
by-6-cm substrate; fig. S18). We patterned eight 
solar cells over a single substrate with each 
device pixel having an aperture area of 2.2 cm 
by 1.125 cm (2.475 cm”), as illustrated in the 
inset of fig. SI9A. All eight devices showed 
nearly identical device performance with a 


minor PCE standard deviation of 0.03% (fig. 
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S19 and table S2). The narrowly distributed 
device performance indicates the effective- 
ness of the blade-coating method to achieve 
large-area uniformity for WBG perovskite 
films. By contrast, the devices distributed on 
the 6-cm-by-6-cm substrate by spin coating 
exhibited larger variations in performance 
among eight pixels (PCE standard deviations 
of 0.36 and 0.63% for Cso.35FAo.65PbI1.gBry.2 
and Cso9FAp.gPbI, sBr, 0, respectively). It should 
be noted that tuning the Cs content is likely 
not the only way to achieve high-quality, uni- 
form WBG perovskite films, so further work in 
optimizing the blading parameter space, pre- 
cursor solvent, and additives may also lead to 
similar or even better-quality films. 

To realize tandem solar cells with scalable 
manufacturing techniques, we used blade coating 
to replace spin coating in all of the solution- 
based processes, including the fabrication 
of narrow bandgap (NBG) perovskite film 
(MAp 3FAg.7Pbo,5SNg.5I3, Where MA is methyl- 
ammonium) and hole transport layers (Fig. 


Wavelength (nm) 


area of 1.05 cm?) deposited on a 2.5-cm-by-2.5-cm substrate. The inset shows 
the PCE distribution of 19 devices, where the boxes and whiskers represent 
the standard deviation and the maximum and minimum of the distributions, 
respectively. (D) EQE curves of the champion device. The front and back subcells 
have integrated J,, values of 15.6 and 16.6 mA cm, respectively. 


2A). All other material layers were deposited 
through either thermal evaporation or ALD, 
which are both scalable processes already used 
in PV manufacturing. The blade-coated NBG 
PSCs (0.049 cm?) delivered a champion PCE of 
19.0% (steady-state PCE of 19.0%) and good re- 
producibility (fig. S20). Monolithic all-perovskite 
tandem solar cells, consisting of a ~400-nm- 
thick WBG perovskite and a ~950-nm-thick 
NBG perovskite (Fig. 2B), were fabricated 
entirely with scalable techniques. The cur- 
rent density-voltage (J-V) and EQE curves 
of the best-performing tandem device with 
an aperture area of 1.05 cm” are presented in 
Fig. 2, C and D, respectively. An average PCE of 
23.7 + 0.7% was obtained from 19 devices (inset 
of Fig. 2C). The champion device had a PCE 
of 24.8% from the reverse scan, with a V,, of 
2.025 V, a Jee of 15.4 mA cm”, and a FF of 
79.4%. A highest efficiency of 25.1% (average 
PCE of 24.2 + 0.6% from 44 devices) was ob- 
tained for 0.049-cm? tandem devices (fig. $21). 
The relatively modest PCE difference between 
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Fig. 3. All-perovskite tandem modules. (A) Schematic diagram of the 
structure of the series-connected all-perovskite tandem module with CDB to 


prevent ion diffusion. HTL, hole transport layer. (B) J- 


module configurations. BCP, bathocuproine. (C) Relationship between the FF, 
GFF, and efficiency of modules. The arrows indicate GFF (purple), FF (blue), 


the 1.05- and 0.049-cm? cells suggests good 
scalability for all-perovskite tandem solar cells. 

We fabricated all-perovskite tandem solar 
modules on 6-cm-by-6-cm substrates. The 
long-term stability and efficiency degradation 
of perovskite tandem modules is attributed in 
part to the interfacial halide-metal electrode 
reaction at the P2-scribed regions between 
the interconnecting subcells (Fig. 3A). To ad- 
dress this challenge, we devised an electrically 
conductive CDB by depositing ~10-nm-thick 
ALD-SnO, after P2 scribing to avoid inter- 
diffusion and reaction between the perov- 
skite and metal electrode (figs. S22 and S23). 
The CDB layer not only reduces the module 
manufacturing complexity but also enables a 
much narrower P2-scribed region (thus higher 
GFF and module efficiency) compared with 
injecting a wide insulating 2D barrier ma- 
terial at the interconnecting regions (26). 
We noted that another ALD-SnO, protect- 
ive layer (~10 nm) deposited on Cg, before 
the P2 scribing was essential to enable the 
P2 process to be performed under ambient 
conditions (Fig. 3B). We speculate that this 
compact protective layer prevented NBG pe- 
rovskite from oxidation during exposure to 
ambient air (2). 

In comparison, all of the modules that did 
not have the ALD-SnO, protective layer before 
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performing the P2 process showed inferior 
performance. After the P2 process, the ALD- 
SnO.-based CDB layer prevented direct con- 
tact between the metal electrode and the 
perovskite absorber at the P2-scribed regions. 
It also prevented direct contact between the 
metal electrode and the conductive poly(3,4- 
ethylenedioxythiophene) polystyrene sulfo- 
nate (PEDOT:PSS) in the recombination layer, 
which could otherwise lower the shunt resis- 
tance. The current-voltage measurements, per- 
formed at the ITO/ALD-SnO,/Ag junction 
(where ITO is indium tin oxide), showed good 
ohmic contact with low vertical resistance (fig. 
$24), suggesting that this semiconducting 
CDB layer allowed effective electrical inter- 
connection between subcells. The CDB layer 
improved both the efficiency and reproduci- 
bility of the all-perovskite tandem solar mod- 
ules (fig. S25A). 

We investigated how the subcell width 
would affect the performance of tandem mod- 
ules. Increasing the subcell width allows for 
a higher GFF and thus potentially higher 
module efficiency, but this strategy adversely 
introduces a higher series resistance and hence 
reduces the FF. We fabricated modules that had 
three to seven subcells with a fixed total area, 
corresponding to subcell widths ranging from 
15 to 6.4: mm (fig. S25B). The GFF is calculated 
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and PCE (red). (D) J-V curves of the champion all-perovskite tandem module 
(aperture area of 20.25 cm’, four subcells in series). The inset shows photos 
of the front side (left) and back side (right) of the module. (E) Summary 

of publicly reported, independently certified PCEs of perovskite solar modules 
(aperture area >10 cm’). 


to decrease linearly from 95.0 to 88.3% as the 
number of subcells increases (Fig. 3C and fig. 
$26). The optimal performance was achieved 
for the four-subcell tandem modules with a 
width of 11.25 mm and a GFF of 93.3% (Fig. 3C 
and table S3). We expect that laser scribing 
(35, 36), instead of mechanical scribing, for 
P2 and P3 processings could allow for an 
even higher GFF and thus higher module 
efficiency, given that the subcell widths of 
tandems that are used herein are much larger 
than those reported in single-junction perov- 
skite solar modules (table S4). 

We fabricated 50 all-perovskite tandem solar 
modules with CDB and with a subcell width 
of 11.25 mm, which showed an average PCE 
of 20.9% (fig. S25A). The performance of 
single-junction WBG and NBG modules is 
summarized in fig. S27 and table S5. The 
champion tandem module exhibited a high 
PCE of 22.5% under reverse scan, with a V4, 
of 8.137 V, a Jee of 3.60 mA cm™, and a FF of 
76.8% (Fig. 3D). Considering a GFF of 93.3%, 
the active-area efficiency of the tandem mod- 
ule reached 24.1% (fig. S28). The tandem 
module showed a minor hysteresis between 
the reverse and forward scans (22.5 versus 
22.1%) and a steady-state PCE of 22.5% ata 
MPP voltage of 6.7 V measured over 3 min 
(fig. S29). 
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Fig. 4, Stability of all- A 


perovskite tandem solar 
modules. (A) Continuous 
MPP tracking of an 
encapsulated tandem 
module over 500 hours 
under full simulated AM1.5 G 
illumination [100 mW cm, 
light-emitting diode (LED) 
simulator] in ambient air J 
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Ag 3d XPS spectra of the 
perovskite surfaces after 
peeling off the Cgg/ALD- 
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center of the exposed 
perovskite surface after 
peeling off the Cgg/ALD- 
SnOz/Ag stacks. 
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A single tandem module under illumination 
could steadily power a cooling fan (fig. S29 
and movie S1), and six tandem modules 
connected in parallel could charge a smart- 
phone (movie $2). One module was sent to 
an accredited independent PV calibration and 
measurement laboratory (Japan Electrical Safety 
and Environment Technology Laboratories) 
for certification. The module delivered a certi- 
fied PCE of 21.7% (fig. S30), which has been 
included in recent solar cell efficiency tables 
(version 59) (5). The certified 21.7% PCE of 
the tandem module surpasses those of single- 
junction perovskite solar modules with areas 
>10 cm? (Fig. 3E and table $4). The PCE of our 
tandem cell (~1 cm?) using scalable fabrication 
delivered a performance comparable to that of 
tandem cells made by spin coating (table S6). 

The encapsulated modules with CDB main- 
tained their initial PCE after dark storage for 
1778 hours under ambient conditions with a 
relative humidity of ~40% (fig. S31). We also 
tested the operating stability of encapsulated 
modules in ambient conditions under con- 
stant simulated 1-sun AM1.5 global (G) solar 
spectrum illumination. The module with CDB 
maintained 75% of its initial PCE (22.1%; fig. 
$32) after 500 hours of MPP tracking, whereas 
the module without CDB degraded to less than 
50% of its initial PCE after 20 hours (Fig. 4A). 
We reasoned that the performance degradation 
of the module with CDB was not mainly at- 
tributed to the WBG perovskite; the single- 
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junction WBG module could maintain 95% 
of its initial PCE after 450 hours of MPP track- 
ing (fig. S33). Two reasons for why the deg- 
radation of the tandem modules with CDB is 
faster than that of the single-junction WBG 
module could be (i) Au as a recombination 
layer could have diffused into the NBG perov- 
skite layer, leading to the formation of defect 
states at the perovskite interface or in the bulk 
(18), and (ii) the reaction at the PEDOT:PSS/ 
NBG perovskite interface may have led to poorer 
charge extraction (37). 

We tracked the thermal stability of encap- 
sulated modules by heating at 85°C in a No 
glove box. The module without CDB degraded 
down to 10% of the initial PCE after 312 hours, 
whereas the module with CDB still main- 
tained >70% of its initial performance (Fig. 4B). 
For modules without CDB, erosion of the metal 
electrode was observed after thermal aging 
at the interconnecting regions (fig. S34). We 
speculated that the erosion was induced by 
the halide-metal interdiffusion at the inter- 
connecting regions (lateral edges of the sub- 
cells in the module) because of the direct 
contact between perovskite and metal. The 
halide-metal interdiffusion would have two 
negative effects: (i) Metal diffusion into the 
perovskite absorber could degrade the perov- 
skite and increase charge carrier recombi- 
nation, and (ii) the halide diffusion into the 
electrode could corrode the metal and reduce 
its electrical conduction. 


Binding energy (eV) 


T if 
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To investigate the halide-metal diffusion, we 
reduced the subcell width in a tandem module 
to obtain 1-mm-wide grid cells at the P2 pro- 
cessing step to monitor the I-Ag interdiffusion 
through x-ray photoelectron spectroscopy (XPS) 
characterization (fig. $35). The devices were 
then subjected to heating at 85°C in a N, 
glovebox for 24 hours. For modules without 
CDB, Ag was detected by XPS in perovskite 
films when the x-ray beam was placed on the 
perovskite film surface after peeling off the 
multiple top layers of Cg9/ALD-SnO,/Ag (Fig. 
4C and fig. $35). This result indicated that 
Ag diffused laterally from the P2 region into 
the perovskite absorbers, whereas the halides 
(I ) diffused into the edge metal electrode where 
they had direct contact with the perovskite (fig. 
$36). By contrast, for the CBD layer containing 
modules, no obvious signals of Ag (or I) were 
detected in the perovskite layer (or edge metal 
electrode), indicating that the I-Ag interdiffu- 
sion was effectively suppressed (8). 

To intuitively elucidate the lateral inter- 
diffusion between the perovskite and metal 
electrode, we further fabricated the follow- 
ing structures: (i) glass/ITO/~400-nm-thick 
WBG perovskite/Ag and (ii) glass/ITO/WBG 
perovskite/CDB/Ag, similar to the lateral struc- 
ture in P2-scribed regions. After thermal aging 
at 85°C, we performed elemental analysis using 
SEM with energy-dispersive x-ray (fig. S37). The 
J-Ag interdiffusion occurred across the direct 
perovskite/electrode contact and extended over 
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the entire perovskite/metal layers, whereas the 
halide-metal interdiffusion was largely hindered 
by the use of CDB. Br-Ag interdiffusion was not 
observed (figs. S36 and S37), possibly because 
of the larger electronegativity of Br and thus 
stronger Pb-Br bonding (38). 

We speculate that the CDB technique is a 
universal approach to enhance the efficiency 
and stability of all types of perovskite solar 
modules. To facilitate mass production in the 
future, development of a green solvent system 
(avoiding the use of toxic DMF) should be con- 
sidered for the manufacturing of all-perovskite 
tandem solar modules (39, 40). 


REFERENCES AND NOTES 


1. GE. Eperon et al., Science 354, 861-865 (2016). 

2. K. Xiao et al., Nat. Energy 5, 870-880 (2020). 

3. R. Lin et al., Nat. Energy 4, 864-873 (2019). 

4. R. Lin et al., Nature 603, 73-78 (2022). 

5. M.A. Green et al., Prog. Photovolt. Res. Appl. 30, 3-12 (2022). 

6. J. E. Bishop, J. A. Smith, D. G. Lidzey, ACS Appl. Mater. 
Interfaces 12, 48237-48245 (2020). 

7. X. Peng et al., Adv. Funct. Mater. 27, 1703704 (2017). 

8. Y. Deng et al., Nat. Energy 3, 560-566 (2018). 

9. M. Yang et al., Nat. Energy 2, 17038 (2017). 

10. Z. Yang et al., Sci. Adv. 7, eabg3749 (2021). 

ll. J.B. Whitaker et al., Sustain. Energy Fuels 2, 2442-2449 (2018). 

12. J. Li et al., Joule 4, 1035-1053 (2020). 

13. N.-G. Park, CrystEngComm 18, 5977-5985 (2016). 

14. Y. Rong et al., Science 361, eaat8235 (2018). 

15. Y. Hu et al., Joule 3, 2076-2085 (2019). 


16. E. Bi et al., Nat. Commun. 8, 15330 (2017). 


SCIENCE science.org 


17. J. Li, Q. Dong, N. Li, L. Wang, Adv. Energy Mater. 7, 1602922 
(2017). 


18. H. Gao et al., Sol. RRL 5, 2100814 (2021). 

19. C. C. Boyd et al., ACS Energy Lett. 3, 1772-1778 (2018). 
20. Y. Deng et al., Nat. Energy 6, 633-641 (2021). 

21. Y. Deng et al., Sci. Adv. 5, eaax7537 (2019). 

22. S. Chen et al., Science 373, 902-907 (2021). 

23. H. Chen et al., Nature 550, 92-95 (2017). 

24. N. G. Park, K. Zhu, Nat. Rev. Mater. 5, 333-350 (2020). 
25. Z. Liu et al., Nat. Energy 5, 596-604 (2020). 

26. E. Bi et al., Joule 3, 2748-2760 (2019). 

27. A. F. Palmstrom et al., Joule 3, 2193-2204 (2019). 

28. D. Zhao et al., Nat. Energy 3, 1093-1100 (2018). 

29. Z. Yang et al., Nat. Commun. 10, 4498 (2019). 

30. J. W. Yoo et al., Joule 5, 2420-2436 (2021). 


31. S. Jariwala et al., Joule 3, 3048-3060 (2019). 

32. Y. Zhao et al., Nat. Commun. 11, 6328 (2020). 

33. R. E. Beal et al., Matter 2, 207-219 (2020). 

34. K. A. Bush et al., ACS Energy Lett. 3, 428-435 (2018). 

35. L. A. Castriotta et al., Adv. Energy Mater. 12, 2103420 (2022). 
36. S. H. Reddy, F. Di Giacomo, A. Di Carlo, Adv. Energy Mater. 12, 
2103534 (2022). 

37. R. Prasanna et al., Nat. Energy 4, 939-947 (2019). 

38. J. Yang et al., Nano Energy 54, 218-226 (2018). 

39. L. Vesce et al., Sol. RRL 5, 2100073 (2021). 

40. R. Swartwout et al., Sol. RRL 6, 2100567 (2021). 


ACKNOWLEDGMENTS 


Funding: This work was financially supported by the National 
Natural Science Foundation of China (61974063, 61921005, and 
U21A2076), the Natural Science Foundation of Jiangsu Province 
(BK20202008 and BK20190315), Fundamental Research Funds for 
the Central Universities (0213/14380206 and 0205/14380252), 
the Frontiers Science Center for Critical Earth Material Cycling 
Fund (DLTD2109), the Program for Innovative Talents and 
Entrepreneurs in Jiangsu, and the Engineering and Physical 
Science Research Council, UK (EP/S004947/1). Y.H. acknowledges 


support from the National University of Singapore (NUS) 
Presidential Young Professorship (R-279-000-617-133 and 
R-279-001-617-133). This work was authored in part by SERIS, 

a research institute at NUS. SERIS is supported by NUS, the 
National Research Foundation Singapore (NRF), the Energy Market 
Authority of Singapore (EMA), and the Singapore Economic 
Development Board (EDB). R.D.J.0. thanks the Penrose Scholarship for 
funding his studentship. Author contributions: H.T. conceived and 
directed the overall project. H.T. and H.J.S. supervised the research. 
K.X. fabricated all the devices and conducted the characterization. 
M.Z. fabricated NBG perovskite films, R.D.J.O. and Y.-H.L. carried out 
PLQY measurements, Z.L. carried out grazing-incidence wide-angle 
x-ray scattering characterizations, X.L. helped with module 
encapsulation and carried out transient photovoltage measurements, 
HLL. and R.L. carried out SEM characterizations, and J.X. helped 

with the ultraviolet-visible absorption and SEM with energy- 
dispersive x-ray characterizations. Y.H., X.W., and J.L. assisted with 
data analyses. K.X., Y.-H.L., D.L., Y.H., H.J.S., and H.T. wrote the 
manuscript. All authors read and commented on the manuscript. 
Competing interests: H.T. and K.X. are inventors on a patent 
application related to this work filed by Nanjing University. H.J.S. is 
a co-founder, Chief Scientific Officer, and director of Oxford PV Ltd., 
a company that is commercializing perovskite PVs. The other 
authors declare that they have no competing interests. Data and 
materials availability: All data are available in the main text or the 
supplementary materials. 


SUPPLEMENTARY MATERIALS 


science.org/doi/10.1126/science.abn7696 
Materials and Methods 

Figs. S1 to 38 

Tables S1 to S6 

References (41-47) 

Movies S1 and S2 


Submitted 23 December 2021; accepted 5 April 2022 
10.1126/science.abn7696 


767 


13 MAY 2022 « VOL 376 ISSUE 6594 


23andMe has 
Open Postdoc Positions - Hiring Now! 


online @sciencecareers.org 


Interested in joining our diverse 
team of ever-curious scientists, 
researchers, and many more who 
share one common goal - to make 
a difference in people’s lives? 


ScienceCareers 


Join our team! 


Visit our website for more information: https://research.23andme.com/postdocs/ . °3andMe 


POSTDOC 


Mm 
=) 
wy) 
oe) 
rs 
co 
= 


ee ET | . 


Science Careers helps you advance 
your career. Learn how! 


= Register for a free online account on = Download our career booklets, including 
ScienceCareers.org. Career Basics, Careers Beyond the Bench, 


= Search hundreds of job postings and and Developing Your Skills. 


find your perfect job. = Complete an interactive, personalized 


= Sign up to receive e-mail alerts about job career plan at “my IDP." 


postings that match your criteria. = Visit our Employer Profiles to learn more 
= Upload your resume into our database about prospective employers. SCIENCECAREERS.ORG 
and connect with employers. = Read relevant career advice articles from 
our library of thousands. 


Watch one of our many webinars on 
different career topics such as job 
searching, networking, and more. 


Visit ScienceCareers.org Science Career Ss 


today — all resources are free FROM THE JOURNAL SCIENCE PAVAAAS 


©) HARVARD MCB ~ 


UNIVERSITY Department of Molecular 
and Cellular Biology 


Postdoctoral Research Positions 


The Department is actively recruiting postdoctoral fellows Location 

interested in a broad range of topics, including cell biology, Cambridge, MA USA 

systems biology, biophysics, biochemistry, developmental 

biology, neurobiology, genetics, microbiology, computational Primary Contact Details 

biology, and structural biology. Department of Molecular and Cellular Biology 


We are seeking highly creative and motivated postdocs with www.meb.harvard.edu 


broad interests and outstanding graduate training. 


We encourage applicants from various backgrounds and 
promote diversity throughout the University. 


Postdoctoral appointments are initially 1 year in duration, 
renewable on an annual basis. 


Recent Faculty in the Department 


Postdoc openings in the Bellono lab: 

Explore molecular adaptations underlying unique behaviors. Octopus, sharks, jellyfish, 
photosynthetic animals, walking fish, and more! Background in cell biology, physiology, 
biochemistry, structure, other molecular approaches. 


The Hekstra lab aims to understand how proteins work as evolving, physical “machines”. 
Doeke To do so, we develop crystallographic, computational and other methods. Let us know 
Hekstra what you are interested in. Experience with computational methods, statistics, physics, and 
ha 
ne 
A. 


Nicholas 
Bellono 


(bio)chemistry is a plus. 

The Prigozhin lab is looking for postdoctoral fellows to be involved in collaborative and 
interdisciplinary work in the area of single-cell and single-molecule biophysics of transmembrane 
signaling. We are particularly interested in cryo-vitrification, all kinds of electron and optical 
microscopy, and G-protein-coupled receptors. Let us know what you are excited about! 


Maxim 
Prigozhin 


The Whipple lab is investigating mechanisms of gene regulation in genomic imprinting, 
functions of imprinted non-coding RNAs in the brain, and therapeutic approaches for 
imprinting disorders. We seek collaborative and cross-disciplinary postdoctoral candidates in 
the areas of epigenetics, genomics, and neurobiology. 


Amanda 
Whipple 


For a Full List of Opportunities and Information on How to Apply, Please Visit: 
https://www.mcb.harvard.edu/job-opportunities/post-doctoral-positions/ 


About Us 


Our Science 

We research and teach how the collective behavior of molecules and cells forms the basis of life. We are driven by a passion 
for discovery and value collaborative approaches to scientific inquiry, where connections between people fuel interdisciplinary 
science and break boundaries across varied experimental systems. Against a backdrop of cutting-edge biological research, 
we work as a team of educators and mentors to inspire and train the next generation of scientists and global citizens. 


Our Community 

At the core of the MCB department is a commitment to foster an environment in which all individuals have the opportunity to 
thrive. It is our shared responsibility to create an inclusive culture, where we support and respect each other as colleagues. 
We embrace a diverse range of perspectives, expertise, identities, experiences, talents, and abilities. By continually 
strengthening this foundation of investing in the well-being of our people, we enable our community’s growth and pursuit of 
the creative and innovative approaches that underlie scientific excellence. 


Harvard University is an equal opportunity employer and all qualified applicants will receive consideration for employment without 
regard to race, color, sex, gender identity, sexual orientation, religion, creed, national origin, ancestry, age, protected veteran status, 
disability, genetic information, military service, pregnancy and pregnancy-related conditions, or other protected status. 


online @sciencecareers.org 
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WORKING LIFE 


By Yang Xu 


770 


Second chances 


hen I started graduate school, everything looked bright. As an undergraduate in China, I felt 
lucky and honored that a South Korean professor visiting my university offered me a place in 
his lab for a Ph.D. It seemed like an amazing opportunity to get international research experi- 
ence. But soon things started to go downhill. The stipend was just enough to cover food, rent, 
and some necessary bills. Even minor expenses such as paying for clothing and social events re- 
quired me to cut back elsewhere. When I looked into external fellowships, which helped many 
of my Korean classmates cover their living expenses, I was told that my international training wasn’t elite 
enough to qualify. Despite feeling overwhelmed and isolated, I carried on—trapped by my fear of failure. 


Time I could have spent studying 
I instead dedicated to finding the 
least expensive groceries I could, 
and cooking to save money. I relied 
on rice and bread to fill my stom- 
ach, but I often still felt hungry. I 
was lonely and madly wanted plea- 
sure to escape from the stress. So, 
I did something I immediately re- 
gretted: I had unprotected sex with 
aman from a dating app. For weeks 
afterward, I was paranoid I might 
have contracted a sexually trans- 
mitted infection. To make matters 
worse, I didn’t have anyone to turn 
to for support because I hadn’t told 
anyone I’m gay. I worried that be- 
ing open about my sexuality would 
bias my co-workers against me, so it 
seemed safer to stay quiet. 

Eventually I went to the doctor 
for a physical. To my relief, my HIV 
test was negative. But the results weren’t all good news. 
Many of my metabolite levels were abnormal, likely because 
of my high-carb diet. I also had high cholesterol, possibly 
because of stress. It was becoming clear that my situation 
was not healthy or sustainable. 

But I had been taught I was not allowed to fail. I feared 
that leaving my Ph.D. program would be the beginning of 
a freefall from which I might not recover. Opportunities to 
work internationally are rare, and I wasn’t sure I would find 
one again. I was also hesitant to talk with my adviser. I 
didn’t want to throw my life problems at him, and I worried 
I would be letting him down. 

Soon, though, I reached my breaking point. I went to my 
adviser and told him I couldn’t continue. To my surprise, he 
wasn’t disappointed in me; instead he said he felt sorry for 
what I was going through and regretted he hadn’t stepped 
in earlier. I realized one more mistake I made was keeping 
my problems to myself. 


“Instead of seeing failure as 
shameful, | could treat it as an 
opportunity to improve.” 


That was 7 years ago. I’m now 
close to finishing my Ph.D., and 
I’ve learned that fear of failure 
needn’t dominate or define me. Af- 
ter returning to China and working 
as a research assistant, I applied 
to Ph.D. programs in the United 
States, where I hoped I would find 
a better fit. The director of my U.S. 
program was concerned I might 
drop out again, but I convinced 
him the first program was not right 
for me and I still had the passion 
to do research. Since then I have 
had a great journey of learning and 
growing in an environment that en- 
courages differing viewpoints. I feel 
valued, respected, and comfortable 
being open about my sexuality. 

It hasn’t always been smooth sail- 
ing. When I failed my first attempt 
at the qualifying exam, all my nega- 
tive feelings around my first Ph.D. experience came flood- 
ing back. But instead of trying to ignore them or having 
too many drinks and hoping they would go away, as I had 
done in the past, I faced them. I realized the notion that one 
should never fail is toxic and only creates fear. Instead of 
seeing failure as shameful, I could treat it as an opportunity 
to improve. I watched talks on YouTube to learn how to give 
better presentations. I practiced my writing by imagining 
an impatient reader whom I can only persuade with clear, 
concise language. I turned to my colleagues for help. The 
second time around, I passed my exam. 

Iam grateful to my program director, who gave me a sec- 
ond chance. And I am proud I managed to leave the gradu- 
ate school that was not right for me and kept the faith that 
I would rise again. 


Yang Xu is a Ph.D. student at the University of Tennessee, Knoxville. 
Send your career story to SciCareerEditor@aaas.org. 
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ARE YOU THE KIND 
WHO WORKS TO HELP 
ALL HUMANKIND? 


Q Find your next job at ScienceCareers.org 


The relevance of science is at an all-time: high these days. For anyone who’s 
looking to get ahead in —or just plain get into— science, there’s no better, more 
trusted resource or authority on the subject than Science Careers. Here you'll 
find opportunities and savvy advice across all disciplines and levels. There’s no 
shortage of global problems today that science can’t solve. Be part of the solution. 


ScienceCareers 


FROM THE JOURNAL SCIENCE JAVAAAS 


Yow’re only an essay away! 


Check out our Awards and Prizes that are 
currently seeking submissions. 


eppendorf 

&, Science Neurobiology 

PRIZE FOR Submissions Opened: February 1 
NEURO Submissions Close: June 15 
BIOLOGY 


Cell and Molecular Biology 

Genomics, Proteomics, and Systems Biology 
Ecology and Environment 

Molecular Medicine 

Submissions Opened: March 1 

Submissions Close: July 15 


MICHELSON 
PHILANTHROPIES Immunology 
& Science Submissions Open: May 1 


Submissions Close: October 1 


Scientific Innovation & Entrepreneurship 
Submissions Open: June 1 
Submissions Close: November 1 


_ Bil} Prize for 
Science | Innovation 


Science Awards and Prizes seeks to find the next generation of early-career 
researchers on the cusp of changing the scientific landscape. Start your journey 
by finding the Award or Prize that fits you and find out how to apply today! 


x eae bit.ly7/3qKxMNO 
Science “aeeete Scan the code to start 


MVAAAS ‘af x at your application process! 


